Rob*_*ton 37
这可能比你需要的要深得多,但还不足以覆盖你的用例,但Unicode联盟必须处理对国际化域名的攻击,并提出了这个同形异义词列表(具有相同或相似的字符)渲染):
http://www.unicode.org/Public/security/latest/confusables.txt
至少可以成为一个起点.
spn*_*nzr 13
http://en.wikipedia.org/wiki/Letterlike_Symbols
它的全面性要低得多,但更容易理解.
我创建了一个 python 类来做到这一点,基于 Robin 的“混淆”的 unicode 链接
https://github.com/wanderingstan/Confusables
例如,“Hello”将扩展为以下一组正则表达式字符类:
[H\?\?\?\?\\\\\\\\\\\?\\\\\\?\?\?\?\?\\?\?\?\?\?]
[e\?\?\?\?\\\\\\\\\\\\\?\?\?\?\?]
[l\?\|\?\?\?1\?\?\\?\\\\\I\?\?\?\?\\\\\\\\\\\\?\?\?\?\\\\\\\\\\\\\\?\?\\\\\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\?\?\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?]
[l\?\|\?\?\?1\?\?\\?\\\\\I\?\?\?\?\\\\\\\\\\\\?\?\?\?\\\\\\\\\\\\\\?\?\\\\\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\?\?\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?]
[o\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\\\\\\\\\\\\?\?\?\?\\\\\\?\\\\\\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\?\\\\\?\ø\?\?\?\?\?\?\?\?\?\?\œ\?\?\?\?\?\?]
这个正则表达式将匹配“?1?”
归档时间: |
|
查看次数: |
45468 次 |
最近记录: |