将大量无效HTML转换为有效的HTML

mat*_*ndr 0 html validation

我正在翻新一个包含大量无效HTML的网站,看起来很像这样:

<p>I was written by someone who knows a little, but not enough, 
   HTML, & now I need to be cleaned.</p>
Run Code Online (Sandbox Code Playgroud)

我需要能够转换此HTML以使其有效.转换需要是智能的,而不是内容.有什么东西可以很容易地大规模地完成这项工作吗?

Kug*_*gel 5

HTML Tidy可能会有所帮助.