小编dbo*_*ose的帖子

如何找出不连续的日期时间索引？如何取连续指数的平均值？

我有一个时间序列数据。但是数据不连续。（2005-03-02 02:08:00丢失）。

我需要一个新的C列C(i)=A(i)+B(i)+average，其中我的平均值是B直到不连续为止的平均值(02:08:00)。

average=Data.loc['2005-03-02 02:05:30':'2005-03-02 02:07:30',['B']].mean(axis=0)  
After discontinuity we have to again recalculate average till next discontinuity  
average=Data.loc['2005-03-02 02:08:30':'2005-03-02 02:11:00',['B']].mean(axis=0)

Run Code Online (Sandbox Code Playgroud)

输入项

Date,A,B  
2005-03-02 02:05:30,1,3   
2005-03-02 02:06:00,2,4   
2005-03-02 02:06:30,3,5  
2005-03-02 02:07:00,4,6  
2005-03-02 02:07:30,5,7  
2005-03-02 02:08:30,7,9  
2005-03-02 02:09:00,7,9  
2005-03-02 02:09:30,7,9  
2005-03-02 02:10:00,8,12  
2005-03-02 02:10:30,9,13  
2005-03-02 02:11:00,10,14

Run Code Online (Sandbox Code Playgroud)

输出量

Date,A,B,C  
2005-03-02 02:05:30,1,3,9  
2005-03-02 02:06:00,2,4,11  
2005-03-02 02:06:30,3,5,13  
2005-03-02 02:07:00,4,6,15  
2005-03-02 02:07:30,5,7,17  
2005-03-02 02:08:30,7,9,28  
2005-03-02 02:09:00,7,9,28  
2005-03-02 02:09:30,7,9,28  
2005-03-02 02:10:00,8,12,32  
2005-03-02 02:10:30,9,13,34  
2005-03-02 02:11:00,10,14,36

Run Code Online (Sandbox Code Playgroud)

如何找出索引中的不连续性？

我如何使用熊猫来做整个事情？

python pandas pandas-groupby

dbo*_*ose

2018 05-03

4
推荐指数

2
解决办法

497
查看次数