基于pandas中的列进行分组和自动增量

kiz*_*lax 7 python pandas

我有一个看起来像这样的熊猫框架:

在此处输入图片说明

有没有办法在最后一列中添加数字而不必遍历数据框?

在大熊猫中使用分组和自动递增组 ID的结果,但没有使它适合我的目的

这是生成数据帧的代码

import pandas as pd
columns = ['Product','SubProd', 'NeedThis']
Index=['4/20/2012','4/27/2012','5/4/2012','5/11/2012','5/18/2012','4/20/2012',
'4/27/2012','5/4/2012','5/11/2012','5/18/2012','5/25/2012','10/31/2014','11/7/2014',
'11/14/2014','11/21/2014','11/28/2014']
datas = {'Product' : ['A','A','A','A','A','A','A','A','A','A','A','B','B','B','B','B'],
      'SubProd' : ['BL','BL','BL','BL','BL','lk','lk','lk','lk','lk','lk','po','po','po','po','po']}
df = pd.DataFrame(data=datas, index=Index)
print(df)
Run Code Online (Sandbox Code Playgroud)

输出:

           Product SubProd
4/20/2012        A      BL
4/27/2012        A      BL
5/4/2012         A      BL
5/11/2012        A      BL
5/18/2012        A      BL
4/20/2012        A      lk
4/27/2012        A      lk
5/4/2012         A      lk
5/11/2012        A      lk
5/18/2012        A      lk
5/25/2012        A      lk
10/31/2014       B      po
11/7/2014        B      po
11/14/2014       B      po
11/21/2014       B      po
11/28/2014       B      po
Run Code Online (Sandbox Code Playgroud)

谢谢

Jef*_*eff 11

In [10]: df['counter'] = df.groupby(['Product','SubProd']).cumcount()+1

In [11]: df
Out[11]: 
           Product SubProd  counter
4/20/2012        A      BL        1
4/27/2012        A      BL        2
5/4/2012         A      BL        3
5/11/2012        A      BL        4
5/18/2012        A      BL        5
4/20/2012        A      lk        1
4/27/2012        A      lk        2
5/4/2012         A      lk        3
5/11/2012        A      lk        4
5/18/2012        A      lk        5
5/25/2012        A      lk        6
10/31/2014       B      po        1
11/7/2014        B      po        2
11/14/2014       B      po        3
11/21/2014       B      po        4
11/28/2014       B      po        5
Run Code Online (Sandbox Code Playgroud)