Dee*_*k M 4 python dataframe pandas
我想使用多个列使用多个条件从数据帧中过滤掉数据.我尝试这样做:
arrival_delayed_weather = [[[flight_data_finalcopy["ArrDelay"] > 0]] & [[flight_data_finalcopy["WeatherDelay"]>0]]]
arrival_delayed_weather_filter = arrival_delayed_weather[["UniqueCarrier", "AirlineID"]]
print arrival_delayed_weather_filter
Run Code Online (Sandbox Code Playgroud)
但是我收到此错误消息:
TypeError:&''list'和'list'不支持的操作数类型
我该如何解决这个问题?
提前致谢
您需要()改为[]:
arrival_delayed_weather = (flight_data_finalcopy["ArrDelay"] > 0) &
(flight_data_finalcopy["WeatherDelay"]>0)
Run Code Online (Sandbox Code Playgroud)
但似乎你需要ix选择列UniqueCarrier并AirlineID通过mask- 稍微修改boolean indexing:
mask = (flight_data_finalcopy["ArrDelay"] > 0) &
(flight_data_finalcopy["WeatherDelay"]>0)
arrival_delayed_weather_filter=flight_data_finalcopy.ix[mask, ["UniqueCarrier","AirlineID"]]
Run Code Online (Sandbox Code Playgroud)
样品:
flight_data_finalcopy = pd.DataFrame({'ArrDelay':[0,2,3],
'WeatherDelay':[0,0,6],
'UniqueCarrier':['s','a','w'],
'AirlineID':[1515,3546,5456]})
print (flight_data_finalcopy)
AirlineID ArrDelay UniqueCarrier WeatherDelay
0 1515 0 s 0
1 3546 2 a 0
2 5456 3 w 6
mask = (flight_data_finalcopy["ArrDelay"] > 0) & (flight_data_finalcopy["WeatherDelay"]>0)
print (mask)
0 False
1 False
2 True
dtype: bool
arrival_delayed_weather_filter=flight_data_finalcopy.ix[mask, ["UniqueCarrier","AirlineID"]]
print (arrival_delayed_weather_filter)
UniqueCarrier AirlineID
2 w 5456
Run Code Online (Sandbox Code Playgroud)