找出段落中出现的单词

Chu*_*Nan 2 python regex string python-2.7

sentence = 'Alice was not a bit hurt, and she jumped up on to her feet in a moment.'
words = ['Alice','jumped','played']
Run Code Online (Sandbox Code Playgroud)

我可以使用filterpython中的函数来查找words所示的所有元素sentence:

print filter(lambda x: x in words,sentence.split())
Run Code Online (Sandbox Code Playgroud)

但如果在零件的空间words,.split()功能导致的错误:

words = ['Alice','jumped up','played']
Run Code Online (Sandbox Code Playgroud)

在这种情况下,'jumped up'无法找到sentence,这是不正确的.

是否有一个简单的方法可以解决问题(也许re包可以做到吗?)

Ash*_*ary 5

您可以使用正则表达式:

In [71]: import re

In [72]: words = ['Alice','jumped','played']

In [73]: [w for w in words if re.search(r'\b{}\b'.format(re.escape(w)), sentence)]
Out[73]: ['Alice', 'jumped']

In [74]: words = ['Alice','jumped up','played']

In [75]: [w for w in words if re.search(r'\b{}\b'.format(re.escape(w)), sentence)]
Out[75]: ['Alice', 'jumped up']
Run Code Online (Sandbox Code Playgroud)