use*_*345 5 python boolean numpy dataframe pandas
输入解释: 我有一个数据框“df”,其中包含“Space”和“Threshold”列。
Space Threshold
TRUE 0.1
TRUE 0.25
FALSE 0.5
FALSE 0.6
Run Code Online (Sandbox Code Playgroud)
需要考虑的场景: 当 df['Space'] 为 TRUE 时,检查 df['Threshold']<=0.2,如果两个条件都满足,则生成一个名为 df['Space_Test'] 且值为 PASS/FAIL 的新列。如果 df['Space'] 值为 FALSE,则将 'FALSE' 作为值放入新生成的列 df['Space_Test']。
预期输出:
Space Threshold Space_Test
TRUE 0.1 PASS
TRUE 0.25 FAIL
FALSE 0.5 FALSE
FALSE 0.6 FALSE
Run Code Online (Sandbox Code Playgroud)
尝试过的代码: 已针对上述场景尝试了下面提到的代码行,但不起作用。
df['Space_Test'] = np.where(df['Space'] == 'TRUE',np.where(df['Threshold'] <= 0.2, 'Pass', 'Fail'),'FALSE')
Run Code Online (Sandbox Code Playgroud)
需要帮助来解决这个问题。提前致谢!
另一种解决方案
from pandas import DataFrame
names = {
'Space': ['TRUE','TRUE','FALSE','FALSE'],
'Threshold': [0.1, 0.25, 1, 2]
}
df = DataFrame(names,columns=['Space','Threshold'])
df.loc[(df['Space'] == 'TRUE') & (df['Threshold'] <= 0.2), 'Space_Test'] = 'Pass'
df.loc[(df['Space'] != 'TRUE') | (df['Threshold'] > 0.2), 'Space_Test'] = 'Fail'
print (df)
Run Code Online (Sandbox Code Playgroud)
如果TRUE是布尔值,您的解决方案只需通过比较来简化df['Space']:
df['Space_Test'] = np.where(df['Space'],
np.where(df['Threshold'] <= 0.2, 'Pass', 'Fail'),'FALSE')
print (df)
Space Threshold Space_Test
0 True 0.10 Pass
1 True 0.25 Fail
2 False 0.50 FALSE
3 False 0.60 FALSE
Run Code Online (Sandbox Code Playgroud)
替代方案numpy.select:
m1 = df['Space']
m2 = df['Threshold'] <= 0.2
df['Space_Test'] = np.select([m1 & m2, m1 & ~m2], ['Pass', 'Fail'],'FALSE')
print (df)
Space Threshold Space_Test
0 True 0.10 Pass
1 True 0.25 Fail
2 False 0.50 FALSE
3 False 0.60 FALSE
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
23550 次 |
| 最近记录: |