所以,我想知道是否有人知道如何组合多个术语来在NLTK中的标记器中创建单个术语..
例如,当我这样做时:
nltk.pos_tag(nltk.word_tokenize('Apple Incorporated is the largest company'))
Run Code Online (Sandbox Code Playgroud)
它给了我:
[('Apple', 'NNP'), ('Incorporated', 'NNP'), ('is', 'VBZ'), ('the', 'DT'), ('largest', 'JJS'), ('company', 'NN')]
Run Code Online (Sandbox Code Playgroud)
我如何将它与'Apple'和'Incorporated'放在一起 ('Apple Incorporated','NNP')