小编Man*_*h V的帖子

如何手动安装 nltk 停用词包

这是我的代码：

from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize

example_sent = "This is a sample sentence, showing off the stop words filtration."

stop_words = set(stopwords.words('english'))

word_tokens = word_tokenize(example_sent)

filtered_sentence = [w for w in word_tokens if not w in stop_words]

filtered_sentence = []

for w in word_tokens:
    if w not in stop_words:
        filtered_sentence.append(w)


print(word_tokens)
print(filtered_sentence)

Run Code Online (Sandbox Code Playgroud)

但是在运行代码时，我收到此错误：

Resource stopwords not found.
Please use the NLTK Downloader to obtain the resource

Run Code Online (Sandbox Code Playgroud)

如果我下载NLTK Downloader，我会收到以下错误：

[nltk_data] Error loading popular: <urlopen error [WinError …

Run Code Online (Sandbox Code Playgroud)

python-3.x

Man*_*h V

2018 08-29

3
推荐指数

1
解决办法

5569
查看次数

标签统计

python-3.x ×1

如何手动安装 nltk 停用词包

标签 统计

小编Man_h V的帖子

标签统计