从 Python 将分层 JSON 数据写入 Excel xls?

aha*_*uel 3 python json xls python-2.7 xlsxwriter

我想从python写一些数据到xlsx。我目前将它存储为 JSON,但它从 Python 中输出什么并不重要。单篇文章的 JSON 如下所示:

{ 
   'Word Count': 50
   'Key Words': { 
                  ['Blah blah blah', 'Foo', ... ] }
   'Frequency': {
                  [9, 12, ... ] }
   'Proper Nouns': { 
                  ['UN', 'USA', ... ] }
   'Location': 'Mordor'
}
Run Code Online (Sandbox Code Playgroud)

我检查了 XlsxWriter 模块,但无法弄清楚如何翻译不一定相同大小的分层数据(注意两个数据“对象”之间的专有名词数量)。

我希望数据看起来像什么:

Excel截图

任何指针?

Mar*_*ans 5

由于您的结构可以任意嵌套,我建议使用递归来实现这一点:

from collections import OrderedDict
import xlsxwriter
import json

def json_to_excel(ws, data, row=0, col=0):
    if isinstance(data, list):
        row -= 1
        for value in data:
            row = json_to_excel(ws, value, row+1, col)
    elif isinstance(data, dict):
        max_row = row
        start_row = row
        for key, value in data.iteritems():
            row = start_row
            ws.write(row, col, key)
            row = json_to_excel(ws, value, row+1, col)
            max_row = max(max_row, row)
            col += 1
        row = max_row
    else:
        ws.write(row, col, data)

    return row

text = """
[
    {
        "Source ID": 123,
        "WordCount": 50,
        "Key Words": ["Blah blah blah", "Foo"],
        "Frequency": [9, 12, 1, 2, 3],
        "Proper Nouns": ["UN", "USA"],
        "Location": "Mordor"
    },
    {
        "Source ID": 124,
        "WordCount": 50,
        "Key Words": ["Blah blah blah", "Foo"],
        "Frequency": [9, 12, 1, 2, 3],
        "Proper Nouns": ["UN", "USA"],
        "Location": "Mordor"
    }
]
"""

data = json.loads(text, object_pairs_hook=OrderedDict)
wb = xlsxwriter.Workbook("output.xlsx")
ws = wb.add_worksheet()
json_to_excel(ws, data)
wb.close()  
Run Code Online (Sandbox Code Playgroud)

这将为您提供一个如下所示的输出文件:

Excel截图