Mar*_*ana 4 python numpy dataframe pandas data-science
我在 gmt 中有两列日期时间,我需要减去这个日期时间的三个小时。例如在第 4 行中,我需要在 3 小时内减去 startdate,结果是:08/02/2018 17:20:0。并且在同一行 4 ai 需要在 3 小时内减去 enddate,结果是:08/02/2018 21:50:0。
cpf day startdate enddate
1234 1 08/01/2018 12:50:0 08/01/2018 15:30:0
1234 1 08/01/2018 14:30:0 08/01/2018 15:40:0
1234 1 08/01/2018 14:50:0 08/01/2018 15:50:0
1234 2 08/02/2018 20:20:0 08/03/2018 00:50:0
1234 3 08/03/2018 01:00:0 08/03/2018 03:50:0
1235 1 08/01/2018 11:50:0 08/01/2018 15:20:0
5212 1 08/01/2018 14:50:0 08/01/2018 15:20:0
Run Code Online (Sandbox Code Playgroud)
cpf day startdate enddate
1234 1 08/01/2018 09:50:0 08/01/2018 10:30:0
1234 1 08/01/2018 11:30:0 08/01/2018 10:40:0
1234 1 08/01/2018 11:50:0 08/01/2018 10:50:0
1234 2 08/02/2018 17:20:0 08/02/2018 21:50:0
1234 3 08/02/2018 22:00:0 08/03/2018 00:50:0
1235 1 08/01/2018 08:50:0 08/01/2018 10:20:0
5212 1 08/01/2018 11:50:0 08/01/2018 10:20:0
Run Code Online (Sandbox Code Playgroud)
我怎么能在python中做到这一点?
PS:请注意结果。谢谢!
您可以使用 timedelta
from datetime import timedelta
df['startdate'] = pd.to_datetime(df['startdate']) - timedelta(hours=3)
df['enddate'] = pd.to_datetime(df['enddate']) - timedelta(hours=3)
Run Code Online (Sandbox Code Playgroud)
我相信您需要转换列to_datetime
并减去3
小时 timedelta:
cols = ['startdate','enddate']
td = pd.Timedelta(3, unit='h')
df[cols] = df[cols].apply(lambda x: pd.to_datetime(x, format='%d/%m/%Y %H:%M:%S') - td
Run Code Online (Sandbox Code Playgroud)
如果要分别为每一列应用解决方案:
td = pd.Timedelta(3, unit='h')
df['startdate'] = pd.to_datetime(df['startdate'], format='%d/%m/%Y %H:%M:%S') - td
df['enddate'] = pd.to_datetime(df['enddate'], format='%d/%m/%Y %H:%M:%S') - td
Run Code Online (Sandbox Code Playgroud)
print (df)
cpf day startdate enddate
0 1234 1 2018-01-08 09:50:00 2018-01-08 12:30:00
1 1234 1 2018-01-08 11:30:00 2018-01-08 12:40:00
2 1234 1 2018-01-08 11:50:00 2018-01-08 12:50:00
3 1234 2 2018-02-08 17:20:00 2018-03-07 21:50:00
4 1234 3 2018-03-07 22:00:00 2018-03-08 00:50:00
5 1235 1 2018-01-08 08:50:00 2018-01-08 12:20:00
6 5212 1 2018-01-08 11:50:00 2018-01-08 12:20:00
Run Code Online (Sandbox Code Playgroud)
最后如果需要将日期时间转换为自定义格式:
df['startdate'] = df['startdate'].dt.strftime('%d/%m/%Y %H:%M:%S')
df['enddate'] = df['enddate'].dt.strftime('%d/%m/%Y %H:%M:%S')
print (df)
cpf day startdate enddate
0 1234 1 08/01/2018 09:50:00 08/01/2018 12:30:00
1 1234 1 08/01/2018 11:30:00 08/01/2018 12:40:00
2 1234 1 08/01/2018 11:50:00 08/01/2018 12:50:00
3 1234 2 08/02/2018 17:20:00 07/03/2018 21:50:00
4 1234 3 07/03/2018 22:00:00 08/03/2018 00:50:00
5 1235 1 08/01/2018 08:50:00 08/01/2018 12:20:00
6 5212 1 08/01/2018 11:50:00 08/01/2018 12:20:00
Run Code Online (Sandbox Code Playgroud)
归档时间: |
|
查看次数: |
13575 次 |
最近记录: |