use*_*236 7 python numpy pandas
我正在尝试复制此处提供的代码:https: //github.com/IdoZehori/Credit-Score/blob/master/Credit%20score.ipynb
下面给出的函数无法运行并给出错误.有人可以帮我解决它
def replaceOutlier(data, method = outlierVote, replace='median'):
'''replace: median (auto)
'minUpper' which is the upper bound of the outlier detection'''
vote = outlierVote(data)
x = pd.DataFrame(zip(data, vote), columns=['annual_income', 'outlier'])
if replace == 'median':
replace = x.debt.median()
elif replace == 'minUpper':
replace = min([val for (val, vote) in list(zip(data, vote)) if vote == True])
if replace < data.mean():
return 'There are outliers lower than the sample mean'
debtNew = []
for i in range(x.shape[0]):
if x.iloc[i][1] == True:
debtNew.append(replace)
else:
debtNew.append(x.iloc[i][0])
return debtNew
Run Code Online (Sandbox Code Playgroud)
功能调用:
incomeNew = replaceOutlier(df.annual_income, replace='minUpper')
Run Code Online (Sandbox Code Playgroud)
错误:x = pd.DataFrame(zip(data,vote),columns = ['annual_income','outlier'])TypeError:data参数不能是迭代器
PS:我知道之前已经问过,但我尝试使用这些技术,但错误仍然存在
PRM*_*reu 15
zip不能直接使用,你应该将结果作为列表给出,即:
x = pd.DataFrame(list(zip(data, vote)), columns=['annual_income', 'outlier'])
Run Code Online (Sandbox Code Playgroud)
归档时间: |
|
查看次数: |
9145 次 |
最近记录: |