有没有办法通过转义不需要的HTML而不是完全删除它来让jsoup清理带有HTML的字符串?我的例子:
String dirty = "This is <b>REALLY</b> dirty code from <a href="www.rubbish.url.zzzz">haxors-r-us</a>
String clean = Jsoup.clean(dirty, new Whitelist().addTags("a").addAttributes("a", "href", "name", "rel", "target"));
Run Code Online (Sandbox Code Playgroud)
这给出了一个"干净"的字符串:
This is REALLY dirty code from <a href="www.rubbish.url.zzzz">haxors-r-us</a>
Run Code Online (Sandbox Code Playgroud)
我想要的是"干净"字符串:
"This is <b>REALLY</b> dirty code from <a href="www.rubbish.url.zzzz">haxors-r-us</a>
Run Code Online (Sandbox Code Playgroud) jsoup ×1