熊猫：将细胞分裂成多个列，写为NaN

Question

熊猫：将细胞分裂成多个列，写为NaN

我有以下内容df：

df = pd.DataFrame({'Category': ['root:catA', 'root:catA:catB'], 'Age':[32, 34]})

Run Code Online (Sandbox Code Playgroud)

现在我要进行拆分的列Category由:进，共4个，列。由于row0将导致4个填充列，而row1在5个填充列中，因此列数将减少，因此其余列应使用填充NaN。

预期产量：

df = pd.DataFrame({'Category': ['root:catA', 'root:catA:catB'], 'Age':[32, 34], 'Cat1':['root', 'root'], 'Cat2':['catA', 'catA'], 'Cat3':['NaN', 'CatB'], 'Cat4':['NaN', 'NaN']})

Run Code Online (Sandbox Code Playgroud)

我尝试了以下方法：

df[['cat1','cat2', "cat3", "cat4"]] = df.Category.str.split(":",expand=True)

Run Code Online (Sandbox Code Playgroud)

但是我得到了 ValueError: Columns must be same length as key

Answer 1

cs9*_*s95 5

几乎在那里，您只需要执行其他重新索引步骤：

df['Category'].str.split(':', expand=True).reindex(range(4), axis=1)                                                   

      0     1     2   3
0  root  catA  None NaN
1  root  catA  catB NaN

Run Code Online (Sandbox Code Playgroud)

现在，分配就可以了。

或者，join他们在一起。

(df['Category'].str.split(':', expand=True)
               .reindex(range(4), axis=1)
               .rename(lambda x: f'cat{x+1}', axis=1)
               .join(df))

   cat1  cat2  cat3  cat4        Category  Age
0  root  catA  None   NaN       root:catA   32
1  root  catA  catB   NaN  root:catA:catB   34

Run Code Online (Sandbox Code Playgroud)

归档时间：	6 年，8 月前
查看次数：	40 次
最近记录：	6 年，8 月前