小编hib*_*ber的帖子

NLTK关系提取不返回任何内容

我最近正致力于使用nltk从文本中提取关系.所以我建立了一个示例文本:"汤姆是微软的联合创始人." 并使用以下程序测试并返回任何内容.我无法弄清楚为什么.

我使用的是NLTK版本:3.2.1,python版本:3.5.2.

这是我的代码:

import re
import nltk
from nltk.sem.relextract import extract_rels, rtuple
from nltk.tokenize import sent_tokenize, word_tokenize


def test():
    with open('sample.txt', 'r') as f:
        sample = f.read()   # "Tom is the cofounder of Microsoft"

    sentences = sent_tokenize(sample)
    tokenized_sentences = [word_tokenize(sentence) for sentence in sentences]
    tagged_sentences = [nltk.tag.pos_tag(sentence) for sentence in tokenized_sentences]

    OF = re.compile(r'.*\bof\b.*')

    for i, sent in enumerate(tagged_sentences):
        sent = nltk.chunk.ne_chunk(sent) # ne_chunk method expects one tagged sentence
        rels = extract_rels('PER', 'GPE', sent, corpus='ace', pattern=OF, window=10) 
        for rel …
Run Code Online (Sandbox Code Playgroud)

python nltk relation semantics knowledge-base-population

6
推荐指数
1
解决办法
2129
查看次数