像真正的 html 元素而不是 canvas 或 svg 一样渲染 pdf.js 页面?

Ntw*_*ste 4 html javascript html5-canvas pdf.js pdfjs-dist

我正在尝试制作可以阅读pdf的简单移动UI,但我计划通过使用我自己制作的pdf阅读来添加许多功能,而不仅仅是使用pdf.js团队制作的查看器,所以我问是否有任何将 pdf 渲染为带有元素的 HTML 的方式,就像他们在查看器上所做的那样,我对画布感到不舒服,任何帮助人员,提前致谢

Ntw*_*ste 7

好吧,我终于找到了一种 pdf.js方法,称为getTextContent()这些方法,当您渲染页面时在每个页面上调用

只需获取文档的每一页

PDFJS.getDocument(url)
  .then(function(pdf) {

    // Get div#container and cache it for later use
    var container = document.getElementById("container");

    // Loop from 1 to total_number_of_pages in PDF document
    for (var i = 1; i <= pdf.numPages; i++) {

        // Get desired page
        pdf.getPage(i).then(function(page) {

          var scale = 1.5;
          var viewport = page.getViewport(scale);
          var div = document.createElement("div");

          // Set id attribute with page-#{pdf_page_number} format
          div.setAttribute("id", "page-" + (page.pageIndex + 1));

          // This will keep positions of child elements as per our needs
          div.setAttribute("style", "position: relative");

          // Append div within div#container
          container.appendChild(div);

          // Create a new Canvas element
          var canvas = document.createElement("canvas");

          // Append Canvas within div#page-#{pdf_page_number}
          div.appendChild(canvas);

          var context = canvas.getContext('2d');
          canvas.height = viewport.height;
          canvas.width = viewport.width;

          var renderContext = {
            canvasContext: context,
            viewport: viewport
          };

          // Render PDF page
          page.render(renderContext);
        });
    }
});
Run Code Online (Sandbox Code Playgroud)

获取每个页面的文本内容 记住它是前面的连续代码然后在里面page.render()添加像这样修改它

// Render PDF page
page.render(renderContext)
  .then(function() {
    // Get text-fragments
    return page.getTextContent();
  })
  .then(function(textContent) {
    // Create div which will hold text-fragments
    var textLayerDiv = document.createElement("div");

    // Set it's class to textLayer which have required CSS styles
    textLayerDiv.setAttribute("class", "textLayer");

    // Append newly created div in `div#page-#{pdf_page_number}`
    div.appendChild(textLayerDiv);

    // Create new instance of TextLayerBuilder class
    var textLayer = new TextLayerBuilder({
      textLayerDiv: textLayerDiv, 
      pageIndex: page.pageIndex,
      viewport: viewport
    });

    // Set text-fragments
    textLayer.setTextContent(textContent);

    // Render text-fragments
    textLayer.render();
  });
Run Code Online (Sandbox Code Playgroud)

有关如何操作的完整教程,请转到此处