Dev*_*ter 6 alias join prefix google-bigquery bigquery-standard-sql
在Google BigQuery(使用#standardSQL)上,当2个表之间存在Join时,我需要对每个表的所有列应用固定的前缀。
这是场景,我有这样的结构:
#standardSQL
WITH user AS (
SELECT "john" as name, "smith" as surname, 1 as parent
UNION ALL
SELECT "maggie" as name, "smith" as surname, 2 as parent
),
parent AS (
SELECT 1 as id, "john" as name, "doe" as surname
UNION ALL
SELECT 2 as id, "jane" as name, "smith" as surname
)
Run Code Online (Sandbox Code Playgroud)
用户表
+-----+--------+---------+--------+
| Row | name | surname | parent |
+-----+--------+---------+--------+
| 1 | john | smith | 1 |
| 2 | maggie | smith | 2 |
+-----+--------+---------+--------+
Run Code Online (Sandbox Code Playgroud)
父表
+-----+----+------+---------+
| Row | id | name | surname |
+-----+----+------+---------+
| 1 | 1 | john | doe |
| 2 | 2 | jane | smith |
+-----+----+------+---------+
Run Code Online (Sandbox Code Playgroud)
这样的查询
SELECT u.*, p.* FROM user u JOIN parent p ON u.parent = p.id
Run Code Online (Sandbox Code Playgroud)
产生以下错误
Error: Duplicate column names in the result are not supported. Found duplicate(s): name, surname
Run Code Online (Sandbox Code Playgroud)
我想避免像这样对表执行自定义别名
SELECT
u.name as user_name,
u.surname as user_surname,
p.name as parent_name,
p.surname as parent_surname
FROM user u JOIN parent p ON u.parent = p.id
+-----+-----------+--------------+-------------+----------------+
| Row | user_name | user_surname | parent_name | parent_surname |
+-----+-----------+--------------+-------------+----------------+
| 1 | john | smith | john | doe |
| 2 | maggie | smith | jane | smith |
+-----+-----------+--------------+-------------+----------------+
Run Code Online (Sandbox Code Playgroud)
如果表将在字段上更改,我将需要每次都编辑该语句(或多个语句),以便应用具有给定前缀的新字段。所以这种使用固定列名的方法不是合适的方法
为了获得上面提到的表,查询运算符是否有办法自动应用前缀?就像是:
SELECT u.* AS user_*, p.* AS parent_*
FROM user u JOIN parent p ON u.parent = p.id
Run Code Online (Sandbox Code Playgroud)
到目前为止我能想到的唯一选择如下
#standardSQL
WITH user AS (
SELECT "john" AS name, "smith" AS surname, 1 AS parent UNION ALL
SELECT "maggie" AS name, "smith" AS surname, 2 AS parent
), parent AS (
SELECT 1 AS id, "john" AS name, "doe" AS surname UNION ALL
SELECT 2 AS id, "jane" AS name, "smith" AS surname
)
SELECT user, parent
FROM user
JOIN parent
ON user.parent = parent.id
Run Code Online (Sandbox Code Playgroud)
结果为
Row user.name user.surname user.parent parent.id parent.name parent.surname
1 john smith 1 1 john doe
2 maggie smith 2 2 jane smith
Run Code Online (Sandbox Code Playgroud)
它不是您所期望的,而是最接近它的,因为它将各个连接表中的每一行包装到各个 STRUCT 中 - 例如:
{
"user": {"name": "john", "surname": "smith","parent": "1"},
"parent": {"id": "1","name": "john","surname": "doe"}
}
Run Code Online (Sandbox Code Playgroud)