在PHP中将ASCII转换为纯文本

e_r*_*e_r 9 html php ascii plaintext

我正在抓取一些网站,并有ASCII文本,我想转换为纯文本存储在数据库中.比如我想要的

I have got to tell anyone who will listen that this is
one of THE best adventure movies I've ever seen.
It's almost impossible to convey how pumped I am
now that I've seen it.
Run Code Online (Sandbox Code Playgroud)

转换成

I have got to tell anyone who will listen that this is
one of THE best adventure movies I've ever seen. It's
almost impossible to convey how pumped I am now that
I've seen it.
Run Code Online (Sandbox Code Playgroud)

我用手指搜索血腥,有什么帮助吗?

ash*_*108 22

你可以使用html_entity_decode:

echo html_entity_decode('...', ENT_QUOTES, 'UTF-8');
Run Code Online (Sandbox Code Playgroud)

几点说明:

  • 请注意,您实际上希望将HTML编码的字符串(具有类似实体)转换为ASCII AKA明文.

  • 此示例转换为UTF-8,它是所有ASCII字符的ASCII兼容字符编码(即char代码低于128).如果你真的想要纯ASCII(因此丢失所有带外语的重音字符和字符),你应该分别删除所有违规字符.

  • 最后一个参数('UTF-8')是保持与不同PHP版本的兼容性所必需的,因为自PHP 5.4.0以来默认值已经改变.

更新:您的文本在ideone中的示例.

Update2:按@ Daan的建议将ENT_COMPAT更改为ENT_QUOTES.