war*_*nry 2 python concat dataframe pandas
我正在尝试连接两个数据框:
df2:
CU Pmt 2017-02-01
h b 15
h d 12
h a 13
Run Code Online (Sandbox Code Playgroud)
和 df1:
CU Pmt 'Total/Max/Min'
h b 20
h d 23
h a 22
a b 16
a d 13
a a 14
Run Code Online (Sandbox Code Playgroud)
这样df3:
CU Pmt 2017-02-01 2017-02-02
h b 15 20
h d 12 23
h a 13 22
a b NaN 16
a d NaN 13
a a Nan 14
Run Code Online (Sandbox Code Playgroud)
我对两者都使用了 index_col = [0,1] 的多索引
这就是我所拥有的:
date = '2017-02-02'
df1 = pd.read_csv(r'Data\2017-02\2017-02-02\Aggregated\Aggregated_Daily_All.csv', usecols=['CU', 'Parameters', 'Total/Max/Min'], index_col =[0,1])
df1 = df1.rename(columns = {'Total/Max/Min':date})
df2 = pd.read_csv(r'Data\2017-02\MonthlyData\February2017.csv', index_col = [0,1])
df3 = pd.concat([df2, df1], axis=1)
df3.to_csv(r'Data\2017-02\MonthlyData\February2017.csv')
Run Code Online (Sandbox Code Playgroud)
但是,df3 的结果是:
CU Pmt 2017-02-01 2017-02-02
a a NaN 14
a b NaN 16
a d Nan 13
h a 13 22
h b 15 20
h d 12 23
Run Code Online (Sandbox Code Playgroud)
其中有CU
和Pmt
(两个索引列)按字母顺序排列。如何保持原始顺序,以便为新日期添加的所有新索引都添加到底部?
reindex
如果值df1.index
包含以下值,您可以尝试df2.index
:
df3 = pd.concat([df2, df1], axis=1).reindex(df1.index)
print (df3)
2017-02-01 'Total/Max/Min'
CU Pmt
h b 15.0 20
d 12.0 23
a 13.0 22
a b NaN 16
d NaN 13
a NaN 14
Run Code Online (Sandbox Code Playgroud)