Lon*_*guy 4 python scikit-learn
这就是我打电话给TFIDFVectorizer的方式:
vectorizer = TfidfVectorizer(
vocabulary=selected_vocabulary,
stop_words='english',
use_idf=True,
norm=norm,
tokenizer=self.tokenize,
lowercase=True,
smooth_idf=True)
Run Code Online (Sandbox Code Playgroud)
当我打电话时收到此错误
vectorizer.transform(data_to_vectorize)
Run Code Online (Sandbox Code Playgroud)
错误:
File "/root/anaconda/lib/python2.7/site-packages/sklearn/feature_extraction/text.py", line 1305, in transform
return self._tfidf.transform(X, copy=False)
File "/root/anaconda/lib/python2.7/site-packages/sklearn/feature_extraction/text.py", line 1024, in transform
raise ValueError("idf vector not fitted")
ValueError: idf vector not fitted
Run Code Online (Sandbox Code Playgroud)
这个错误在这里意味着什么?
在转换任意文本之前,您需要首先拟合模型(例如,根据数据构建词汇表):
vectorizer.fit(data_to_vectorize)
X = vectorizer.transform(data_to_vectorize)
Run Code Online (Sandbox Code Playgroud)
要么
X = vectorizer.fit_transform(data_to_vectorize)
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
2042 次 |
| 最近记录: |