我正在将C#函数转换为Python.对于与存在功能兼容的bug应该是bug.
这是该函数的正则表达式:http://[a-zA-Z0-9]+(\.[a-zA-Z0-9]+)+([-A-Z0-9a-z_$.+!*()/\\\,:;@&=?~#%]*)*.但Python无法编译它:
>>> re.compile(r"http://[a-zA-Z0-9]+(\.[a-zA-Z0-9]+)+([-A-Z0-9a-z_$.+!*()/\\\,:;@&=?~#%]*)*")
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/lib/python3.3/re.py", line 214, in compile
return _compile(pattern, flags)
File "/usr/lib/python3.3/re.py", line 281, in _compile
p = sre_compile.compile(pattern, flags)
File "/usr/lib/python3.3/sre_compile.py", line 498, in compile
code = _code(p, flags)
File "/usr/lib/python3.3/sre_compile.py", line 483, in _code
_compile(code, p.data, flags)
File "/usr/lib/python3.3/sre_compile.py", line 75, in _compile
elif _simple(av) and op is not REPEAT:
File "/usr/lib/python3.3/sre_compile.py", line 362, in _simple
raise error("nothing to repeat")
sre_constants.error: nothing to repeat
Run Code Online (Sandbox Code Playgroud)
注意:该正则表达式的JavaScript版本:/http:\/\/[a-zA-Z0-9]+(\.[a-zA-Z0-9]+)+([-A-Z0-9a-z\$\.\+\!\_\*\(\)\/\,\:;@&=\?~#%]*)*/gi.
我查了一下
对不起,这是重复的帖子.nothing to repeatError,但什么也没得到.
问题出在哪儿?
Fra*_*bot 10
我用以下内容复制了错误:
re.compile(r"([A]*)*")
Run Code Online (Sandbox Code Playgroud)
问题是[A]*可能匹配空字符串.猜猜当它试图匹配会发生什么([A]*)*的时候[A]*是空的?"没有什么可重复的".不过,正则表达式引擎不会等待实际发生.它失败了,因为它甚至可能远程发生.
这应该适合你:
r"http://[a-zA-Z0-9]+(\.[a-zA-Z0-9]+)+([-A-Z0-9a-z_$.+!*()/\\\,:;@&=?~#%]*)"
Run Code Online (Sandbox Code Playgroud)
我刚删除了最后一个*.