我有一个df:
date category subcategory order_id product_id branch
2021-05-04 A aa 10 5 web
2021-06-04 A dd 10 2 web
2021-05-06 B aa 18 3 shop
2021-07-06 A aa 50 10 web
2021-07-06 C cc 10 15 web
2021-07-05 A ff 101 30 shop
2021-10-04 D aa 100 15 shop
Run Code Online (Sandbox Code Playgroud)
我试图回答一个问题:每周按分支类型一起购买哪些商品类别和子类别。我正在考虑将其分组order_ids并聚合category & subcategory为list 如下所示:
a = (df.set_index('date')
.groupby(['order_id','branch'])
.resample('W-MON', label = 'left')
.agg({'category':list, 'subcategory':list}))
Run Code Online (Sandbox Code Playgroud)
返回:
category subcategory
order_id branch date [A, A, …Run Code Online (Sandbox Code Playgroud)