如何在我的网站上实现 Mozilla readability.js?

Mar*_*cel 5 html javascript mozilla readability

https://github.com/mozilla/readability(readability.js用于为网页创建阅读视图)

我怎样才能在这个测试网页上实现 readability.js 问题是,readability.js 删除了这个网站的元素,我想保留并留下那些应该删除的元素。我希望有一个人可以帮助我。谢谢!有没有关于如何使用 readability.js 的文档?

<html><head>
<title>Reader View shows only the browser in reader view</title>
    <script src="https://raw.githack.com/mozilla/readability/master/Readability.js"></script>
</head>
<body>
Everything outside the main div tag vanishes in Reader View<br>
<img class="no-print" src="http://dummyimage.com/1024x100/000/ffffff&text=This+banner+should+vanish+in+print+view">
<div>
   <h1>H1 tags outside ot a p tag are hidden in reader view</h1>
   <img class="no-print" src="http://dummyimage.com/1024x100/000/ffffff&text=This+banner+is resized+in+print+view">
   <p>
 123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
 123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
 123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
 123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
 123456789012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789
 123456789 123456
</p>
</div>
</body>
    <script>
    var article = new Readability(document).parse();
    </script>
</html>
Run Code Online (Sandbox Code Playgroud)

测试页面来源:优化网站以在Firefox中显示阅读器视图

akk*_*hil 6

您可以像他们在文档中提到的那样一起使用 DOMPurify 和 Readability -

import { Readability } from '@mozilla/readability'
import DOMPurify from 'dompurify';

function readable(doc) {
  const reader = new Readability(doc)
  const article = reader.parse()
  return article
}

let cloneDoc = document.cloneNode(true)
let parsed = readable(cloneDoc)
const markup = DOMPurify.sanitize(parsed.content)
Run Code Online (Sandbox Code Playgroud)

markup将是可读内容的 html 字符串。尝试console.log(parsed)查看可用的属性。


Mar*_*cel 0

好的....

    document.getElementById("body").innerHTML = "<font face='Calibri' size='4'> 
    <h1>"+article.title+"</h1>"+article.content;
Run Code Online (Sandbox Code Playgroud)