I need to extract text from pdf-files and have used pdfminer.six with success, extracting both text paragraphs and tables. But now I get an error related to the line
from pdfminer.pdfparser import PDFParser, PDFDocument:
Run Code Online (Sandbox Code Playgroud)
ImportError: cannot import name 'PDFDocument' from 'pdfminer.pdfparser' (C:\Users[username]\Anaconda3\lib\site-packages\pdfminer\pdfparser.py)
I'm using Anaconda Jupyter. Python 3.7.3. Package pdfminer.six-20181108
The code I'm using is based on this: How to read pdf file using pdfminer3k?
Based on advice given below I've tried to uninstall and reinstall Anaconda and pdfminer.six and …
我正在尝试解压缩作为附件发送到我的电子邮件的 DMARC 报告。它适用于 zip 文件,但不适用于 gz 文件。
\n在我的代码中,我首先按主题获取正确的电子邮件。如果主题正确,则运行此脚本:
\nvar attachments = message.getAttachments();\n for(var k in attachments){\n var attachment = attachments[k];\n var attachmentBlob = attachment.copyBlob();\n var vedleggsnavn = attachment.getName();\n Logger.log(vedleggsnavn)\n var vedleggstype = attachment.getContentType();\n Logger.log(vedleggstype)\n if(vedleggstype==\'application/gzip\'){\n Logger.log("ja gzip");\n var files = Utilities.ungzip(attachmentBlob);\n }\n if(vedleggstype==\'application/zip\'){\n Logger.log("ja zip");\n var files = Utilities.unzip(attachmentBlob);\n }\nRun Code Online (Sandbox Code Playgroud)\n如果附件类型是应用程序/zip,则会解压缩并保存在我的 Google 云端硬盘中。如果它是应用程序/gzip,我会收到错误。这是我的日志:
\n