相关疑难解决方法(0)

带有变音符号和连字的Python 3正则表达式,

表格中的名字:Ceasar,Julius将分为First_name Julius姓氏Ceasar.

名称可能包含变音符号(áàé..)和连字符(æ,ø)

这段代码似乎在Python 3.3中运行正常

import re

def doesmatch(pat, str):  
    try: 
        yup = re.search(pat, str)
        print('Firstname {0} lastname {1}'.format(yup.group(2), yup.group(1)))
    except AttributeError:
        print('no match for {0}'.format(str))

s = 'Révèrberë, Harry'
t = 'Åapö, Renée'
u = 'C3po, Robby'
v = 'Mærsk, Efraïm'
w = 'MacDønald, Ron'
x = 'Sträßle, Mpopo'

pat = r'^([^\d\s]+), ([^\d\s]+)'
# matches any letter, diacritic or ligature, but not digits or punctuation inside the () 

for i in s, t, u, v, w, x:
    doesmatch(pat, i) …
Run Code Online (Sandbox Code Playgroud)

python regex diacritics

7
推荐指数
1
解决办法
553
查看次数

标签 统计

diacritics ×1

python ×1

regex ×1