Spacy中的命名实体识别

sha*_*han 11 python named-entity-recognition spacy

我试图找到一个句子的命名实体,如下所示

import spacy.lang.en
parser = spacy.lang.en.English()
ParsedSentence = parser(u"Alphabet is a new startup in China")
for Entity in  ParsedSentence.ents:  
    print (Entity.label, Entity.label_, ' '.join(t.orth_ for t in Entity))
Run Code Online (Sandbox Code Playgroud)

我期待得到结果"字母","中国",但我得到一个空集作为结果.我在这做错了什么

Jan*_*nda 24

根据名称实体识别的spacy 文档,这里是提取名称实体的方法

import spacy
nlp = spacy.load('en') # install 'en' model (python3 -m spacy download en)
doc = nlp("Alphabet is a new startup in China")
print('Name Entity: {0}'.format(doc.ents))
Run Code Online (Sandbox Code Playgroud)

结果
Name Entity: (China,)
要将"Alphabet"标识为公司名称,请在其前添加"The",以便将其标识为"Noun"

doc = nlp("The Alphabet is a new startup in China")
print('Name Entity: {0}'.format(doc.ents))
Run Code Online (Sandbox Code Playgroud)

Name Entity: (Alphabet, China)