我有一个看起来像这样的熊猫框架:
有没有办法在最后一列中添加数字而不必遍历数据框?
我在大熊猫中使用分组和自动递增组 ID的结果,但没有使它适合我的目的
这是生成数据帧的代码
import pandas as pd
columns = ['Product','SubProd', 'NeedThis']
Index=['4/20/2012','4/27/2012','5/4/2012','5/11/2012','5/18/2012','4/20/2012',
'4/27/2012','5/4/2012','5/11/2012','5/18/2012','5/25/2012','10/31/2014','11/7/2014',
'11/14/2014','11/21/2014','11/28/2014']
datas = {'Product' : ['A','A','A','A','A','A','A','A','A','A','A','B','B','B','B','B'],
'SubProd' : ['BL','BL','BL','BL','BL','lk','lk','lk','lk','lk','lk','po','po','po','po','po']}
df = pd.DataFrame(data=datas, index=Index)
print(df)
Run Code Online (Sandbox Code Playgroud)
输出:
Product SubProd
4/20/2012 A BL
4/27/2012 A BL
5/4/2012 A BL
5/11/2012 A BL
5/18/2012 A BL
4/20/2012 A lk
4/27/2012 A lk
5/4/2012 A lk
5/11/2012 A lk
5/18/2012 A lk
5/25/2012 A lk
10/31/2014 B po
11/7/2014 B po
11/14/2014 B po
11/21/2014 B po
11/28/2014 B po
Run Code Online (Sandbox Code Playgroud)
谢谢
Jef*_*eff 11
In [10]: df['counter'] = df.groupby(['Product','SubProd']).cumcount()+1
In [11]: df
Out[11]:
Product SubProd counter
4/20/2012 A BL 1
4/27/2012 A BL 2
5/4/2012 A BL 3
5/11/2012 A BL 4
5/18/2012 A BL 5
4/20/2012 A lk 1
4/27/2012 A lk 2
5/4/2012 A lk 3
5/11/2012 A lk 4
5/18/2012 A lk 5
5/25/2012 A lk 6
10/31/2014 B po 1
11/7/2014 B po 2
11/14/2014 B po 3
11/21/2014 B po 4
11/28/2014 B po 5
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
5069 次 |
| 最近记录: |