Python - 查找给定文件中出现n次的单词列表

erk*_*rol 0 python counter list python-3.x

我想找到一个在给定文件中出现n次(例如200)的单词列表.为此,我使用以下代码获取文件中的每个唯一令牌,但我无法理解如何获得具有n次出现条件的那些令牌.

from collections import Counter
import re

seen = list()
words = re.findall(r'[\w+]+', open('deneme.txt').read())
seen = Counter(words).most_common()
Run Code Online (Sandbox Code Playgroud)

输出是:

[('Erke', 4), ('aç+Noun', 4), ('Antalya', 3), ('123', 3), ('ol+Verb', 3), ('Varol', 2), ('Koleji', 1), ('asdfsdf', 1), ('birak+Verb', 1)]
Run Code Online (Sandbox Code Playgroud)

例如,我想获得3次出现的令牌.我怎样才能做到这一点.我无法达到列表中的出现次数.

Dan*_*ejo 5

你可以使用列表理解:

from collections import Counter
import re

seen = list()
words = re.findall(r'[\w+]+', open('deneme.txt').read())
seen = Counter(words).most_common()

print([w for w, c in seen if c == 3])
Run Code Online (Sandbox Code Playgroud)

产量

 ['123', 'Antalya', 'ol+Verb']
Run Code Online (Sandbox Code Playgroud)