嗨,我遇到了这个问题,我的数据源事件看起来像这样:
event_id device_id timestamp longitude latitude
0 1 29182687948017175 2016-05-01 00:55:25 121.38 31.24
1 2 -6401643145415154744 2016-05-01 00:54:12 103.65 30.97
2 3 -4833982096941402721 2016-05-01 00:08:05 106.60 29.7
Run Code Online (Sandbox Code Playgroud)
我试图通过device_id对事件进行分组,然后使用该device_id获取变量的sum/mean/std:
events['latitude_mean'] = events.groupby(['device_id'])['latitude'].aggregate(np.sum)
Run Code Online (Sandbox Code Playgroud)
但我的输出总是:
event_id device_id timestamp longitude latitude
0 1 29182687948017175 2016-05-01 00:55:25 121.38 31.24
1 2 -6401643145415154744 2016-05-01 00:54:12 103.65 30.97
2 3 -4833982096941402721 2016-05-01 00:08:05 106.60 29.70
3 4 -6815121365017318426 2016-05-01 00:06:40 104.27 23.28
4 5 -5373797595892518570 2016-05-01 00:07:18 115.88 28.66
latitude_mean
0 NaN
1 NaN
2 NaN
3 …Run Code Online (Sandbox Code Playgroud)