删除python中字符串中不是字母的第一个字符后面的任何内容

import re
re.split("[^A-Za-z ]|  ", "My string is #not very beautiful")[0].strip()
# 'My string is'

re.split("[^A-Za-z ]|  ", "this is the last  example")[0].strip()
# 'this is the last'

re.split("[^A-Za-z ]|  ", "Are you 9 years old?")[0].strip()
# 'Are you'

Run Code Online (Sandbox Code Playgroud)

[^A-Za-z ]|包含两种模式,第一种模式是单个字符,既不是字母也不是空格; 第二种模式是双白空间; 拆分这两种模式中的一种,拆分后的第一个元素应该是您正在寻找的.

Answer 2

ins*_*get 2

创建一个白名单，并在看到不在该白名单中的内容时停止：

import itertools
import string

def rstrip(s, whitelist=None):
    if whitelist is None:
        whitelist = set(string.ascii_letters + ' ')  # set the whitelist to a default of all letters A-Z and a-z and a space
    # split on double-whitespace and take the first split (this will work even if there's no double-whitespace in the string)
    # use `itertools.takewhile` to include the characters that in the whitelist
    # use `join` to join them inot one single string

    return ''.join(itertools.takewhile(whitelist.__contains__, s.split('  ', 1)[0]))

Run Code Online (Sandbox Code Playgroud)

归档时间：	9 年，1 月前
查看次数：	349 次
最近记录：	9 年，1 月前