jbr*_*own 6 python json google-bigquery google-cloud-platform
我通过google-cloud-python如下查询BigQuery :
client = bigquery.Client()
query = """SELECT * FROM `{dataset}.{table}`
WHERE id=@id LIMIT 1""".format(dataset=dataset,
table=table)
param = ScalarQueryParameter('id', 'STRING', id)
query = client.run_sync_query(query, query_parameters=[param])
query.use_legacy_sql = False
query.timeout_ms = 1000
query.run()
assert query.complete
try:
results = query.rows[0]
except IndexError:
results = None
Run Code Online (Sandbox Code Playgroud)
返回的数据如下:
[
"Tue, 11 Apr 2017 03:18:52 GMT",
"A132",
"United Kingdom",
[
{
"endDate": "2012-12-05",
"startDate": "2011-12-27",
"statusCode": "Terminated"
}
]
]
Run Code Online (Sandbox Code Playgroud)
重复的字段已转换为JSON。但我也希望将其余数据也转换为JSON。我可以通过检查自己实现此功能,query.schema但是似乎应该在库中,因为对于重复的元素它已经发生了。
如何使用此库获取格式化为JSON的BigQuery查询结果?例如:
{
"timestamp": "Tue, 11 Apr 2017 03:18:52 GMT",
"id": "A132",
"country": "United Kingdom",
[
{
"endDate": "2012-12-05",
"startDate": "2011-12-27",
"statusCode": "Terminated"
}
]
}
Run Code Online (Sandbox Code Playgroud)
As it turns out, the code is simple enough:
field_names = [f.name for f in query.schema]
try:
raw_results = query.rows[0]
zipped_results = zip(field_names, raw_results)
results = {x[0]: x[1] for x in zipped_results}
except IndexError:
results = None
Run Code Online (Sandbox Code Playgroud)