尝试使用看起来像这样的东西:
CATEGORY | NUMBER VALUE | ID
FRUIT | 15 | XCD
VEGGIES | 12 | ZYK
from sklearn.preprocessing import LabelEncoder, OneHotEncoder
data = data.iloc[:,:].values
enc = LabelEncoder()
data[:,0]=enc .fit_transform(data[:,0])
data
Run Code Online (Sandbox Code Playgroud)
array([[1, 15, 'XCD'],
[2, 12, 'ZYK']])
Run Code Online (Sandbox Code Playgroud)
然后...
from sklearn.compose import ColumnTransformer
ct = ColumnTransformer(transformers=[('encode',OneHotEncoder,[0])],remainder='passthrough')
dataset = np.array(ct.fit_transform(data))
Run Code Online (Sandbox Code Playgroud)
给
TypeError: Cannot clone object. You should provide an instance of scikit-learn estimator instead of a class.
Run Code Online (Sandbox Code Playgroud)