我正在尝试学习Python的Pandas库,然后我遇到了用于时间序列分析的“滚动窗口”的概念。我从来都不是统计学的好学生,所以我有点迷茫。
请解释这个概念,最好是使用一个简单的例子,也许是一个代码片段。
发布于 2017-08-20 20:11:51
演示:
设置:
In [11]: df = pd.DataFrame({'a':np.arange(10, 17)})
In [12]: df
Out[12]:
a
0 10
1 11
2 12
3 13
4 14
5 15
6 162 rows窗口的滚动和:
In [13]: df['a'].rolling(2).sum()
Out[13]:
0 NaN # sum of the current and previous value: 10 + NaN = NaN
1 21.0 # sum of the current and previous value: 10 + 11
2 23.0 # sum of the current and previous value: 11 + 12
3 25.0 # ...
4 27.0
5 29.0
6 31.0
Name: a, dtype: float643 rows窗口的滚动和:
In [14]: df['a'].rolling(3).sum()
Out[14]:
0 NaN # sum of current value and two preceeding rows: 10 + NaN + Nan
1 NaN # sum of current value and two preceeding rows: 10 + 11 + Nan
2 33.0 # sum of current value and two preceeding rows: 10 + 11 + 12
3 36.0 # ...
4 39.0
5 42.0
6 45.0
Name: a, dtype: float64https://stackoverflow.com/questions/45784628
复制相似问题