Har*_*der 4 python numpy machine-learning pandas
我有一个数据集,其中包含一些分类变量,但它们缺少一些(NA/Null)。我想用该列的模式填充这些 NA/Null。我厌倦了以下事情,但这不起作用
MD=Data['Gender'].mode()
Data['Gender'].fillna(value=MD,inplace=True)
MD=Data['Married'].mode()
Data['Married'].fillna(value=MD,inplace=True)
MD=Data['Dependents'].mode()
Data['Dependents'].fillna(value=MD,inplace=True)
MD=Data['Self_Employed'].mode()
Data['Self_Employed'].fillna(value=MD,inplace=True)
MD=Data['Credit_History'].mode()
Data['Credit_History'].fillna(value=MD,inplace=True)
Gender 26
Married 6
Dependents 30
Education 0
Self_Employed 64
ApplicantIncome 0
CoapplicantIncome 0
LoanAmount 0
Loan_Amount_Term 0
Credit_History 100
Property_Area 0
Loan_Status 0
Run Code Online (Sandbox Code Playgroud)
仍然显示缺失值。
尝试这个:
Data['Married'].fillna(Data['Married'].mode(), inplace=True)
Run Code Online (Sandbox Code Playgroud)
或者试试这个:
Data['Married'].fillna(Data['Married'].value_counts().index[0], inplace=True)
Run Code Online (Sandbox Code Playgroud)
确保分类变量的 dtype 是对象或类别。
| 归档时间: |
|
| 查看次数: |
4342 次 |
| 最近记录: |