在这里.我刚从机器学习开始,用一个简单的例子来尝试和学习.因此,我想通过使用分类器根据文件类型对磁盘中的文件进行分类.我写的代码是,
import sklearn
import numpy as np
#Importing a local data set from the desktop
import pandas as pd
mydata = pd.read_csv('file_format.csv',skipinitialspace=True)
print mydata
x_train = mydata.script
y_train = mydata.label
#print x_train
#print y_train
x_test = mydata.script
from sklearn import tree
classi = tree.DecisionTreeClassifier()
classi.fit(x_train, y_train)
predictions = classi.predict(x_test)
print predictions
Run Code Online (Sandbox Code Playgroud)
而我收到的错误是,
script class div label
0 5 6 7 html
1 0 0 0 python
2 1 1 1 csv
Traceback (most recent call last):
File "newtest.py", line 21, …Run Code Online (Sandbox Code Playgroud)