我从之前的 SO 问题中提取了一些 Python 代码,但这些代码是为 PDFMiner 的先前版本编写的(并且从那时起 PDFMiner 似乎发生了一些重大更改)。我已经做了一些更改来解决这些错误,但现在我收到以下错误:
C:\Users\xxxx\Documents\Programming\Python>pdfextractor.py
Traceback (most recent call last):
File "C:\Users\xxxx\Documents\Programming\Python\pdfextractor.py", line 71, in <module>
pdf_to_csv(sourcefile)
File "C:\Users\xxxx\Documents\Programming\Python\pdfextractor.py", line 55, in pdf_to_csv
for i, page in PDFPage.get_pages(doc):
File "C:\Program Files\Python27\lib\site-packages\pdfminer\pdfpage.py", line 119, in get_pages
parser = PDFParser(fp)
File "C:\Program Files\Python27\lib\site-packages\pdfminer\pdfparser.py", line 43, in __init__
PSStackParser.__init__(self, fp)
File "C:\Program Files\Python27\lib\site-packages\pdfminer\psparser.py", line 495, in __init__
PSBaseParser.__init__(self, fp)
File "C:\Program Files\Python27\lib\site-packages\pdfminer\psparser.py", line 166, in __init__
self.seek(0)
File "C:\Program Files\Python27\lib\site-packages\pdfminer\psparser.py", line 507, in seek
PSBaseParser.seek(self, pos)
File "C:\Program …Run Code Online (Sandbox Code Playgroud)