jer*_*han 5 sorting stack time-series pandas
我有以下几天的时间序列数据..我想按“日期”拆开它。但我使用 .unstack() 然后自动排序时间..(日期/时间是多索引)
Date Time a b c d e
2015-12-06 22:00:00 21.26 0 2.62 242.195 0
2015-12-06 22:15:00 21.14 0 2.55 255.516 0
2015-12-06 22:30:00 21.2 0 2.49 241.261 0
2015-12-06 22:45:00 21.18 0 2.48 232.058 0
2015-12-06 23:00:00 21.12 0 2.38 239.661 0
2015-12-06 23:15:00 21 0 2.23 228.324 0
2015-12-06 23:30:00 21.13 0 2.29 0 0
2015-12-06 23:45:00 21.12 0 2.29 0 0
2015-12-06 0:00:00 21.02 0 2.17 0 0
2015-12-06 0:15:00 21.09 0 2.13 0 0
2015-12-06 0:30:00 20.96 0 2.21 0 0
2015-12-06 0:45:00 20.92 0 2.19 0 0
2015-12-06 1:00:00 20.99 0 2.13 0 0
2015-12-06 1:15:00 20.92 0 2.14 0 0
2015-12-06 1:30:00 20.97 0 2.13 0 0
2015-12-06 1:45:00 20.85 0 2.11 0 0
2015-12-06 2:00:00 20.76 0 1.72 0 0
Run Code Online (Sandbox Code Playgroud)
我想要的结果如下。我该怎么做?
a a a a...
Date 2015-12-06 0:00 2015-12-13 0:00 2015-12-20 0:00 2015-12-23 0:00...
Time
22:00:00 21.02 21.26 20.75 22.61
22:15:15:00 21.09 21.36 20.74 22.65
..
0:00:00 20.92 21.2 20.79 22.37
0:15:00 20.99 21.33 20.77 22.44
0:30:00 20.92 21.24 20.76 22.28
..
Run Code Online (Sandbox Code Playgroud)
您需要unstack按第一级,然后reindex按unique第二级的值,最后按列中sort_index的第二级的值:MutiIndex
df = (df
.unstack(0)
.reindex(pd.unique(df.index.get_level_values(1)))
.sort_index(axis=1, level=1)
)
print(df)
Run Code Online (Sandbox Code Playgroud)
a b c c e
Date 2015-12-06 2015-12-06 2015-12-06 2015-12-06 2015-12-06
Time
22:00:00 21.26 0 2.62 242.195 0
22:15:00 21.14 0 2.55 255.516 0
22:30:00 21.20 0 2.49 241.261 0
22:45:00 21.18 0 2.48 232.058 0
23:00:00 21.12 0 2.38 239.661 0
23:15:00 21.00 0 2.23 228.324 0
23:30:00 21.13 0 2.29 0.000 0
23:45:00 21.12 0 2.29 0.000 0
00:00:00 21.02 0 2.17 0.000 0
00:15:00 21.09 0 2.13 0.000 0
00:30:00 20.96 0 2.21 0.000 0
00:45:00 20.92 0 2.19 0.000 0
01:00:00 20.99 0 2.13 0.000 0
01:15:00 20.92 0 2.14 0.000 0
01:30:00 20.97 0 2.13 0.000 0
01:45:00 20.85 0 2.11 0.000 0
02:00:00 20.76 0 1.72 0.000 0
Run Code Online (Sandbox Code Playgroud)
编辑:
idx = (pd.date_range('2015-01-01', '2015-01-01 23:45:00', freq='15T') +
pd.to_timedelta('22:00:00')
).time
df = df.unstack(0).reindex(idx)
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
2082 次 |
| 最近记录: |