Ann*_*ine 7 python dataframe pandas
我正在尝试清理 Excel 文件以进行进一步研究。我遇到的问题,我想合并第一行和第二行。我现在拥有的代码:
xl = pd.ExcelFile("nanonose.xls")
df = xl.parse("Sheet1")
df = df.drop('Unnamed: 2', axis=1)
## Tried this line but no luck
##print(df.head().combine_first(df.iloc[[0]]))
Run Code Online (Sandbox Code Playgroud)
这个的输出是:
Nanonose Unnamed: 1 A B C D E \
0 Sample type Concentration NaN NaN NaN NaN NaN
1 Water 9200 95.5 21.0 6.0 11.942308 64.134615
2 Water 9200 94.5 17.0 5.0 5.484615 63.205769
3 Water 9200 92.0 16.0 3.0 11.057692 62.586538
4 Water 4600 53.0 7.5 2.5 3.538462 35.163462
F G H
0 NaN NaN NaN
1 21.498560 5.567840 1.174135
2 19.658560 4.968000 1.883444
3 19.813120 5.192480 0.564835
4 6.876207 1.641724 0.144654
Run Code Online (Sandbox Code Playgroud)
所以,我的目标是合并第一行和第二行以获得:样本类型 | 浓度| 一个 | 乙 | C | D | E | F | G | H
有人可以帮我合并这两行吗?
我想你需要numpy.concatenate,像c???s??类似的原理?回答:
df.columns = np.concatenate([df.iloc[0, :2], df.columns[2:]])
df = df.iloc[1:].reset_index(drop=True)
print (df)
Sample type Concentration A B C D E F \
0 Water 9200 95.5 21.0 6.0 11.942308 64.134615 21.498560
1 Water 9200 94.5 17.0 5.0 5.484615 63.205769 19.658560
2 Water 9200 92.0 16.0 3.0 11.057692 62.586538 19.813120
3 Water 4600 53.0 7.5 2.5 3.538462 35.163462 6.876207
G H
0 5.567840 1.174135
1 4.968000 1.883444
2 5.192480 0.564835
3 1.641724 0.144654
Run Code Online (Sandbox Code Playgroud)
只需重新分配即可df.columns。
df.columns = np.append(df.iloc[0, :2], df.columns[2:])
Run Code Online (Sandbox Code Playgroud)
或者,
df.columns = df.iloc[0, :2].tolist() + (df.columns[2:]).tolist()
Run Code Online (Sandbox Code Playgroud)
接下来,跳过第一行。
df = df.iloc[1:].reset_index(drop=True)
df
Sample type Concentration A B C D E F \
0 Water 9200 95.5 21.0 6.0 11.942308 64.134615 21.498560
1 Water 9200 94.5 17.0 5.0 5.484615 63.205769 19.658560
2 Water 9200 92.0 16.0 3.0 11.057692 62.586538 19.813120
3 Water 4600 53.0 7.5 2.5 3.538462 35.163462 6.876207
G H
0 5.567840 1.174135
1 4.968000 1.883444
2 5.192480 0.564835
3 1.641724 0.144654
Run Code Online (Sandbox Code Playgroud)
reset_index如果您希望最终输出的索引为 0,则该选项是可选的。
| 归档时间: |
|
| 查看次数: |
9446 次 |
| 最近记录: |