将分割词和标点符号连接到正确的位置

use*_*501 9 python split join punctuation

因此,我尝试join()在将字符串拆分为单词和标点符号后使用,但它将字符串与单词和标点符号之间的空格连接起来。

b = ['Hello', ',', 'who', 'are', 'you', '?']
c = " ".join(b)

但这会返回:
c = 'Hello , who are you ?'

而且我要:
c = 'Hello, who are you?'

Mar*_*ers 8

您可以先加入标点符号:

def join_punctuation(seq, characters='.,;?!'):
    characters = set(characters)
    seq = iter(seq)
    current = next(seq)

    for nxt in seq:
        if nxt in characters:
            current += nxt
        else:
            yield current
            current = nxt

    yield current

c = ' '.join(join_punctuation(b))
Run Code Online (Sandbox Code Playgroud)

生成join_punctuation器生成已加入任何以下标点符号的字符串:

>>> b = ['Hello', ',', 'who', 'are', 'you', '?']
>>> list(join_punctuation(b))
['Hello,', 'who', 'are', 'you?']
>>> ' '.join(join_punctuation(b))
'Hello, who are you?'
Run Code Online (Sandbox Code Playgroud)