使用google-cloud-python从BigQuery以JSON格式获取结果

jbr*_*own 6 python json google-bigquery google-cloud-platform

我通过google-cloud-python如下查询BigQuery :

client = bigquery.Client()

query = """SELECT * FROM `{dataset}.{table}`
  WHERE id=@id LIMIT 1""".format(dataset=dataset,
                                 table=table)

param = ScalarQueryParameter('id', 'STRING', id)
query = client.run_sync_query(query, query_parameters=[param])
query.use_legacy_sql = False
query.timeout_ms = 1000
query.run()

assert query.complete

try:
    results = query.rows[0]
except IndexError:
    results = None
Run Code Online (Sandbox Code Playgroud)

返回的数据如下:

[
    "Tue, 11 Apr 2017 03:18:52 GMT",
    "A132",
    "United Kingdom",
    [
        {
            "endDate": "2012-12-05",
            "startDate": "2011-12-27",
            "statusCode": "Terminated"
        }
    ]
]
Run Code Online (Sandbox Code Playgroud)

重复的字段已转换为JSON。但我也希望将其余数据也转换为JSON。我可以通过检查自己实现此功能,query.schema但是似乎应该在库中,因为对于重复的元素它已经发生了。

如何使用此库获取格式化为JSON的BigQuery查询结果?例如:

{
    "timestamp": "Tue, 11 Apr 2017 03:18:52 GMT",
    "id": "A132",
    "country": "United Kingdom",
    [
        {
            "endDate": "2012-12-05",
            "startDate": "2011-12-27",
            "statusCode": "Terminated"
        }
    ]
}
Run Code Online (Sandbox Code Playgroud)

jbr*_*own 2

As it turns out, the code is simple enough:

field_names = [f.name for f in query.schema]

try:
    raw_results = query.rows[0]
    zipped_results = zip(field_names, raw_results)
    results = {x[0]: x[1] for x in zipped_results}
except IndexError:
    results = None
Run Code Online (Sandbox Code Playgroud)