moh*_*ith 2 sql postgresql indexing ruby-on-rails query-performance
我有一个 PostgreSQL 字符串数组作为表中的一列。我使用 GIN 方法创建了一个索引。但任何查询都不会使用索引(相反,它们使用过滤器对整个表进行顺序扫描)。我缺少什么?
这是我的迁移:
class CreateDocuments < ActiveRecord::Migration
def up
create_table :documents do |t|
t.string :title
t.string :tags, array: true, default: []
t.timestamps
end
add_index :documents, :tags, using: 'gin'
(1..100000).each do |i|
tags = []
tags << 'even' if (i % 2) == 0
tags << 'odd' if (i % 2) == 1
tags << 'divisible by 3' if (i % 3) == 0
tags << 'divisible by 4' if (i % 4) == 0
tags << 'divisible by 5' if (i % 5) == 0
Document.create(
title: i,
tags: tags
)
end
end
def down
drop_table :documents
end
end
Run Code Online (Sandbox Code Playgroud)
这是我的查询,以及结果行数。
Document.where("'divisible by 5' = ANY (tags)").explain
Document Load (249.8ms) SELECT "documents".* FROM "documents" WHERE ('divisible by 5' = ANY (tags))
D, [2014-03-07T17:09:49.689709 #41937] DEBUG -- : Document Load (249.8ms) SELECT "documents".* FROM "documents" WHERE ('divisible by 5' = ANY (tags))
=> EXPLAIN for: SELECT "documents".* FROM "documents" WHERE ('divisible by 5' = ANY (tags))
QUERY PLAN
-----------------------------------------------------------------
Seq Scan on documents (cost=0.00..3500.00 rows=20057 width=69)
Filter: ('divisible by 5'::text = ANY ((tags)::text[]))
(2 rows)
Document.where("'divisible by 5' = ANY (tags)").length
Document Load (258.0ms) SELECT "documents".* FROM "documents" WHERE ('divisible by 5' = ANY (tags))
D, [2014-03-07T17:09:55.536517 #41937] DEBUG -- : Document Load (258.0ms) SELECT "documents".* FROM "documents" WHERE ('divisible by 5' = ANY (tags))
=> 20000
Run Code Online (Sandbox Code Playgroud)
要使用 GIN 索引,请使用<@(“包含于”)运算符而不是ANY构造。
手册中指出,默认 GIN 索引当前仅支持这些运算符(附加功能随扩展一起提供):
<@
@>
=
&&
Run Code Online (Sandbox Code Playgroud)
所以尝试这个查询:
Document.where("'{divisible by 5}' <@ tags").explain
Run Code Online (Sandbox Code Playgroud)
请注意,左侧array notation也需要位于 中,即使它是单个元素。该运算符<@适用于数组。因此'{divisible by 5}'。