完成:
解析器-tika.py
import tika
from tika import parser
parsed = parser.from_file('/path/to/file')
print parsed["metadata"]
print parsed["content"]
Run Code Online (Sandbox Code Playgroud)
错误:导入错误:无法导入名称解析器
环境设置:
TIKA_VERSION=1.13.1
TIKA_SERVER_JAR=~/parserDev/tika/tika-server-1.13.jar
TIKA_SERVER_ENDPOINT=http://localhost:8989/tika
Run Code Online (Sandbox Code Playgroud)