将 json 文件读入 pandas 数据帧时解码“字符串”错误时出现不匹配的“””

abh*_*bhi 5 python pandas jupyter-notebook

我正在尝试使用 pd.read_json() 将亚马逊评论数据加载到 pandas dataframe(这是一个 JSON 文件)中,我收到以下错误Unmatched ''"' when when decoding 'string'.我正在使用 jupyter 笔记本

数据格式:

{"reviewerID": "AGL65XWV7MH3C", "asin": "B003FMUVKO", "reviewerName": "William B. Bebout \"Acknud\"", "helpful": [0, 1], "reviewText": "Too short. I would have rated it higher if it was long enough to hold my attention! It did have significant violence but not much else.", "overall": 3.0, "summary": "Short", "unixReviewTime": 1304985600, "reviewTime": "05 10, 2011"}
Run Code Online (Sandbox Code Playgroud)

Python代码:

data =pd.read_json('sample_data.json', lines=True)
Run Code Online (Sandbox Code Playgroud)

Phi*_*yfi 5

我刚刚遇到了同样的错误,在尝试了所有可能的解决方案之后,简单的修复方法是删除文件末尾的空行。

在文件末尾留下空行是一种常见的约定,但由于某种原因似乎会导致 pandas 窒息,