abh*_*bhi 5 python pandas jupyter-notebook
我正在尝试使用 pd.read_json() 将亚马逊评论数据加载到 pandas dataframe(这是一个 JSON 文件)中,我收到以下错误Unmatched ''"' when when decoding 'string'.我正在使用 jupyter 笔记本
数据格式:
{"reviewerID": "AGL65XWV7MH3C", "asin": "B003FMUVKO", "reviewerName": "William B. Bebout \"Acknud\"", "helpful": [0, 1], "reviewText": "Too short. I would have rated it higher if it was long enough to hold my attention! It did have significant violence but not much else.", "overall": 3.0, "summary": "Short", "unixReviewTime": 1304985600, "reviewTime": "05 10, 2011"}
Run Code Online (Sandbox Code Playgroud)
Python代码:
data =pd.read_json('sample_data.json', lines=True)
Run Code Online (Sandbox Code Playgroud)
我刚刚遇到了同样的错误,在尝试了所有可能的解决方案之后,简单的修复方法是删除文件末尾的空行。
在文件末尾留下空行是一种常见的约定,但由于某种原因似乎会导致 pandas 窒息,