从熊猫时间序列生成星期几箱图的最佳方法

dka*_*fry 8 python time-series dayofweek boxplot pandas

我正在尝试为时间序列创建一组星期几的箱形图(例如,5分钟的温度观测值).

我的代码:

# ts is our timeseries
ts = df.SomeColumn

dow_map = {}
days = ['MON','TUE','WED','THU','FRI','SAT','SUN']
dow_idx = ts.index.dayofweek

i = 0
for d in days:
    dow_map[d] = ts[dow_idx == i]
    i = i + 1

df = pd.DataFrame(dow_map)
df.boxplot()
Run Code Online (Sandbox Code Playgroud)

结果是:

---------------------------------------------------------------------------
Exception                                 Traceback (most recent call last)
<ipython-input-898-6070c45e4c4b> in <module>()
     41     i = i + 1
     42 
---> 43 df = pd.DataFrame(dow_map)
     44 df.boxplot()
...
Exception: Reindexing only valid with uniquely valued Index objects
Run Code Online (Sandbox Code Playgroud)

我确实通过为每个星期创建DataFrames然后将它们连接到最终的DataFrame来获得成功,但这似乎效率低下......

Joo*_*oop 11

1st创建数据框并使用工作日方法获取星期几:

import pandas as pd
import numpy.random as random
n=1000
df = pd.DataFrame(random.randn(n), pd.date_range('2010-01-01', periods=n), columns=["data"])
df['Dates'] = df.index
df['week_days'] =df.index.weekday
df
Run Code Online (Sandbox Code Playgroud)

现在转动该表,以便将week_days作为列(也可以将needdays更改为字符串格式的天数,但留给您.

x =df.pivot(index='Dates', columns='week_days', values='data')
x.boxplot()
Run Code Online (Sandbox Code Playgroud)

在此输入图像描述