jua*_*osl 3 google-cloud-platform google-cloud-spanner
在查询中,如果我LIKE '<value>%'
在主键上使用它,它使用索引执行得很好:
Operator | Rows returned | Executions | Latency
-- | -- | -- | --
Serialize Result 32 1 1.80 ms
Sort 32 1 1.78 ms
Hash Aggregate 32 1 1.73 ms
Distributed union 32 1 1.61 ms
Hash Aggregate 32 1 1.56 ms
Distributed union 128 1 1.34 ms
Compute - - -
FilterScan 128 1 1.33 ms
Table Scan: <tablename> 128 1 1.30 ms
Run Code Online (Sandbox Code Playgroud)
尽管如此, usingLIKE '<value>_'
执行全表扫描:
Operator | Rows returned | Executions | Latency
-- | -- | -- | --
Serialize Result | 32 | 1 | 76.27 s
Sort | 32 | 1 | 76.27 s
Hash Aggregate | 32 | 1 | 76.27 s
Distributed union | 32 | 1 | 76.27 s
Hash Aggregate | 32 | 2 | ~72.18 s
Distributed union | 128 | 2 | ~72.18 s
Compute | - | - | -
FilterScan | 128 | 2 | ~72.18 s
Table Scan: <tablename> (full scan: true) | 13802624 | 2 | ~69.97 s
Run Code Online (Sandbox Code Playgroud)
查询如下所示:
SELECT
'aggregated-quadkey AS quadkey' AS quadkey, day,
SUM(a_value_1), SUM(a_value_2), AVG(a_value_3), SUM(a_value_4), SUM(a_value_5), AVG(a_value_6), AVG(a_value_6), AVG(a_value_7), SUM(a_value_8), SUM(a_value_9), AVG(a_value_10), SUM(a_value_11), SUM(a_value_12), AVG(a_value_13), AVG(a_value_14), AVG(a_value_15), SUM(a_value_16), SUM(a_value_17), AVG(a_value_18), SUM(a_value_19), SUM(a_value_20), AVG(a_value_21), AVG(a_value_22), AVG(a_value_23)
FROM <tablename>
WHERE quadkey LIKE '03201012212212322_'
GROUP BY quadkey, day ORDER BY day
Run Code Online (Sandbox Code Playgroud)
对于匹配 LIKE 模式 ( column LIKE 'xxx%'
)的前缀,查询优化器在内部将条件转换为STARTS_WITH(column, 'xxx')
,然后使用索引。
所以原因可能是因为查询优化器不够聪明,无法转换匹配 LIKE 模式的精确长度前缀
column LIKE 'xxx_'
Run Code Online (Sandbox Code Playgroud)
进入组合状态:
(STARTS_WITH(column, 'xxx') AND CHAR_LENGTH(column)=4)
Run Code Online (Sandbox Code Playgroud)
同样,一个模式,如
`column LIKE 'abc%def'`
Run Code Online (Sandbox Code Playgroud)
未优化为组合条件:
`(STARTS_WITH(column,'abc') AND ENDS_WITH(column,'def'))`.
Run Code Online (Sandbox Code Playgroud)
您始终可以通过使用上述条件优化 SQL 生成中的查询来解决此问题。
(这是假设 LIKE 模式是查询中的字符串值,而不是参数 -LIKE
无法优化使用参数,因为该模式在查询编译时未知。)
归档时间: |
|
查看次数: |
289 次 |
最近记录: |