Ofe*_*Ron 10 python pandas pandas-groupby
拥有df至少列C1,C2,C3的pandas数据框如何将所有唯一的C1,C2,C3值作为新的DataFrame?
换句话说,类似于:
SELECT C1,C2,C3
FROM T
GROUP BY C1,C2,C3
Run Code Online (Sandbox Code Playgroud)
试过了
print df.groupby(by=['C1','C2','C3'])
Run Code Online (Sandbox Code Playgroud)
但我得到了
<pandas.core.groupby.DataFrameGroupBy object at 0x000000000769A9E8>
Run Code Online (Sandbox Code Playgroud)
jez*_*ael 20
drop_duplicates如果想要所有独特的三元组,我相信你需要:
df = df.drop_duplicates(subset=['C1','C2','C3'])
Run Code Online (Sandbox Code Playgroud)
如果想要使用groupby添加first:
df = df.groupby(by=['C1','C2','C3'], as_index=False).first()
Run Code Online (Sandbox Code Playgroud)