aha*_*uel 3 python json xls python-2.7 xlsxwriter
我想从python写一些数据到xlsx。我目前将它存储为 JSON,但它从 Python 中输出什么并不重要。单篇文章的 JSON 如下所示:
{
'Word Count': 50
'Key Words': {
['Blah blah blah', 'Foo', ... ] }
'Frequency': {
[9, 12, ... ] }
'Proper Nouns': {
['UN', 'USA', ... ] }
'Location': 'Mordor'
}
Run Code Online (Sandbox Code Playgroud)
我检查了 XlsxWriter 模块,但无法弄清楚如何翻译不一定相同大小的分层数据(注意两个数据“对象”之间的专有名词数量)。
我希望数据看起来像什么:
任何指针?
由于您的结构可以任意嵌套,我建议使用递归来实现这一点:
from collections import OrderedDict
import xlsxwriter
import json
def json_to_excel(ws, data, row=0, col=0):
if isinstance(data, list):
row -= 1
for value in data:
row = json_to_excel(ws, value, row+1, col)
elif isinstance(data, dict):
max_row = row
start_row = row
for key, value in data.iteritems():
row = start_row
ws.write(row, col, key)
row = json_to_excel(ws, value, row+1, col)
max_row = max(max_row, row)
col += 1
row = max_row
else:
ws.write(row, col, data)
return row
text = """
[
{
"Source ID": 123,
"WordCount": 50,
"Key Words": ["Blah blah blah", "Foo"],
"Frequency": [9, 12, 1, 2, 3],
"Proper Nouns": ["UN", "USA"],
"Location": "Mordor"
},
{
"Source ID": 124,
"WordCount": 50,
"Key Words": ["Blah blah blah", "Foo"],
"Frequency": [9, 12, 1, 2, 3],
"Proper Nouns": ["UN", "USA"],
"Location": "Mordor"
}
]
"""
data = json.loads(text, object_pairs_hook=OrderedDict)
wb = xlsxwriter.Workbook("output.xlsx")
ws = wb.add_worksheet()
json_to_excel(ws, data)
wb.close()
Run Code Online (Sandbox Code Playgroud)
这将为您提供一个如下所示的输出文件:
| 归档时间: |
|
| 查看次数: |
3161 次 |
| 最近记录: |