Java REGEX代码验证印度语言字符不起作用？

Question

Java REGEX代码验证印度语言字符不起作用？

为什么下面的代码与印度语言不起作用(导致错误)？

System.out.println(Charset.forName("UTF-8").encode("??????")
                .asCharBuffer().toString().matches("\\p{L}+"));

System.out.println(Charset.forName("UTF-8").encode("??????")
                .asCharBuffer().toString().matches("\\p{L}+"));

System.out.println(Charset.forName("UTF-8").encode("???????")
                .asCharBuffer().toString().matches("\\p{L}+"));

Run Code Online (Sandbox Code Playgroud)

以上所有代码都返回false.这个正则表达式有什么问题？如何验证世界上任何unicode角色？

Answer 1

You*_*maa 5

\p{Letter}只捕获字母，但您还需要标记，您可以使用\p{Mark}.

System.out.println("??????".matches("[\\pL\\pM]+"));

Run Code Online (Sandbox Code Playgroud)

归档时间：	12 年，8 月前
查看次数：	1242 次
最近记录：	12 年，8 月前