我是Pandas的新手,作为练习,我正在移动一些旧的代码/解决方案以从中学习。在这种情况下,我试图计算产品价格的综合指数,该指数在使用SQL进行计算之前。
这是我在数据框中的数据:
id weight date price
0 1 0.002796 2005-11-15 0.998298
1 1 0.002796 2005-11-16 1.014242
2 1 0.002796 2005-11-17 1.016452
3 1 0.002796 2005-11-18 1.026396
4 1 0.002796 2005-11-19 1.026047
5 1 0.002796 2005-11-20 1.024285
6 1 0.002796 2005-11-21 1.018764
7 1 0.002796 2005-11-22 1.033175
8 1 0.002796 2005-11-23 1.058509
9 1 0.002796 2005-11-24 1.061231
10 1 0.002796 2005-11-25 1.058137
11 1 0.002796 2005-11-26 0.999380
12 1 0.002796 2005-11-27 0.990504
13 1 0.002796 2005-11-28 0.993764
14 1 0.002796 2005-11-29 0.978754
15 1 0.002796 2005-11-30 0.992070
... ... ... ... ...
4085 1 0.002796 2017-01-21 0.857420
4086 1 0.002796 2017-01-22 0.848195
4087 1 0.002796 2017-01-23 0.791784
4088 1 0.002796 2017-01-24 0.846603
4089 1 0.002796 2017-01-25 0.878104
4090 1 0.002796 2017-01-26 0.806651
4091 1 0.002796 2017-01-27 0.849316
4092 1 0.002796 2017-01-28 0.826550
4093 1 0.002796 2017-01-29 0.848651
4094 1 0.002796 2017-01-30 0.829643
4095 1 0.002796 2017-01-31 0.837094
4096 1 0.002796 2017-02-01 0.846572
4097 1 0.002796 2017-02-02 0.800163
4098 1 0.002796 2017-02-03 0.820356
4099 1 0.002796 2017-02-04 0.818924
4100 1 0.002796 2017-02-05 0.822157
4101 1 0.002796 2017-02-06 0.787123
4102 1 0.002796 2017-02-07 0.796264
4103 1 0.002796 2017-02-08 0.797241
4104 1 0.002796 2017-02-09 0.818499
4105 1 0.002796 2017-02-10 0.810928
综合指数是根据每日收益计算的,即每日收益:
Rt =(Price_day / Price_day_before)-1
我一直在阅读有关熊猫,时间序列等的信息,但我一直在努力了解要在此处执行的具体操作;这是滚动吗?如何获取给定日期和之前日期的数据?
您要使用pct_change()方法的IIUC:
In [196]: x
Out[196]:
id weight date price
0 1 0.002796 2005-11-15 0.998298
1 1 0.002796 2005-11-16 1.014242
2 1 0.002796 2005-11-17 1.016452
3 1 0.002796 2005-11-18 1.026396
4 1 0.002796 2005-11-19 1.026047
5 1 0.002796 2005-11-20 1.024285
6 1 0.002796 2005-11-21 1.018764
7 1 0.002796 2005-11-22 1.033175
8 1 0.002796 2005-11-23 1.058509
9 1 0.002796 2005-11-24 1.061231
10 1 0.002796 2005-11-25 1.058137
11 1 0.002796 2005-11-26 0.999380
12 1 0.002796 2005-11-27 0.990504
13 1 0.002796 2005-11-28 0.993764
14 1 0.002796 2005-11-29 0.978754
15 1 0.002796 2005-11-30 0.992070
In [197]: x['Rt'] = x['price'].pct_change()
In [198]: x
Out[198]:
id weight date price Rt
0 1 0.002796 2005-11-15 0.998298 NaN
1 1 0.002796 2005-11-16 1.014242 0.015971
2 1 0.002796 2005-11-17 1.016452 0.002179
3 1 0.002796 2005-11-18 1.026396 0.009783
4 1 0.002796 2005-11-19 1.026047 -0.000340
5 1 0.002796 2005-11-20 1.024285 -0.001717
6 1 0.002796 2005-11-21 1.018764 -0.005390
7 1 0.002796 2005-11-22 1.033175 0.014146
8 1 0.002796 2005-11-23 1.058509 0.024521
9 1 0.002796 2005-11-24 1.061231 0.002572
10 1 0.002796 2005-11-25 1.058137 -0.002915
11 1 0.002796 2005-11-26 0.999380 -0.055529
12 1 0.002796 2005-11-27 0.990504 -0.008882
13 1 0.002796 2005-11-28 0.993764 0.003291
14 1 0.002796 2005-11-29 0.978754 -0.015104
15 1 0.002796 2005-11-30 0.992070 0.013605
另一个解决方案(使用shift()方法):
In [199]: x['Rt2'] = x['price'] / x['price'].shift() - 1
In [200]: x
Out[200]:
id weight date price Rt Rt2
0 1 0.002796 2005-11-15 0.998298 NaN NaN
1 1 0.002796 2005-11-16 1.014242 0.015971 0.015971
2 1 0.002796 2005-11-17 1.016452 0.002179 0.002179
3 1 0.002796 2005-11-18 1.026396 0.009783 0.009783
4 1 0.002796 2005-11-19 1.026047 -0.000340 -0.000340
5 1 0.002796 2005-11-20 1.024285 -0.001717 -0.001717
6 1 0.002796 2005-11-21 1.018764 -0.005390 -0.005390
7 1 0.002796 2005-11-22 1.033175 0.014146 0.014146
8 1 0.002796 2005-11-23 1.058509 0.024521 0.024521
9 1 0.002796 2005-11-24 1.061231 0.002572 0.002572
10 1 0.002796 2005-11-25 1.058137 -0.002915 -0.002915
11 1 0.002796 2005-11-26 0.999380 -0.055529 -0.055529
12 1 0.002796 2005-11-27 0.990504 -0.008882 -0.008882
13 1 0.002796 2005-11-28 0.993764 0.003291 0.003291
14 1 0.002796 2005-11-29 0.978754 -0.015104 -0.015104
15 1 0.002796 2005-11-30 0.992070 0.013605 0.013605
本文收集自互联网,转载请注明来源。
如有侵权,请联系 [email protected] 删除。
我来说两句