Redshift JSON性能

Question

Redshift JSON性能

bur*_*mre 1 json amazon-web-services amazon-redshift

Redshift提供了一组允许执行JSON操作的JSON函数,但是与普通表列相比,是否有人测试过JSON的性能？我想使用Redshift的主要原因是它的柱状性质,因为我想对存储在Redshift中的分析数据执行复杂的聚合.

根据Redshift文档中的JSON函数页面,似乎Redshift不会以JAR数据类型的列式格式存储数据.(链接)但是我知道一些基于Redshift的分析服务以JSON格式存储数据.有没有人在Redshift中对JSON格式进行基准测试？是否真的值得使用,甚至认为它失去了Redshift的主要好处？

Answer 1

jdw*_*yah 8

一个数据点:

cluster:24xdc1.large表:2.4B行

select count(*) from table where json_extract_path_text(what_extra_json, 'is_logged_in') is not null
   count
------------
 1364239102

Run Code Online (Sandbox Code Playgroud)

24.7sec

select count(*) from table where plan is not null
   count
------------
 1370337529

Run Code Online (Sandbox Code Playgroud)

6.1sec

归档时间：	10 年，8 月前
查看次数：	1131 次
最近记录：	10 年，5 月前