{
"size": 0,
"aggs": {
"categories_agg": {
"terms": {
"field": "categories",
"order": {
"_count": "desc"
}
}
}
}
}
Run Code Online (Sandbox Code Playgroud)
为了获取特定字段的聚合,我使用了上面给出的查询。它工作正常并给出如下结果:
{
"took": 10,
"timed_out": false,
"_shards": {
"total": 5,
"successful": 5,
"failed": 0
},
"hits": {
"total": 77445,
"max_score": 0,
"hits": []
},
"aggregations": {
"categories_agg": {
"doc_count_error_upper_bound": 794,
"sum_other_doc_count": 148316,
"buckets": [
{
"key": "Restaurants",
"doc_count": 25071
},
{
"key": "Shopping",
"doc_count": 11233
},
{
"key": "Food",
"doc_count": 9250
},
{
"key": "Beauty & Spas",
"doc_count": 6583
},
{
"key": "Health & Medical",
"doc_count": 5121
},
{
"key": "Nightlife",
"doc_count": 5088
},
{
"key": "Home Services",
"doc_count": 4785
},
{
"key": "Bars",
"doc_count": 4328
},
{
"key": "Automotive",
"doc_count": 4208
},
{
"key": "Local Services",
"doc_count": 3468
}
]
}
}
}
Run Code Online (Sandbox Code Playgroud)
有没有一种方法可以过滤聚合,以便我可以获得doc_count
每个存储桶的特定范围内的存储桶?
例如,使用范围过滤器来确定doc_count
max25000
和 min5000
应该给我
{
"took": 10,
"timed_out": false,
"_shards": {
"total": 5,
"successful": 5,
"failed": 0
},
"hits": {
"total": 77445,
"max_score": 0,
"hits": []
},
"aggregations": {
"categories_agg": {
"doc_count_error_upper_bound": 794,
"sum_other_doc_count": 148316,
"buckets": [
{
"key": "Shopping",
"doc_count": 11233
},
{
"key": "Food",
"doc_count": 9250
},
{
"key": "Beauty & Spas",
"doc_count": 6583
},
{
"key": "Health & Medical",
"doc_count": 5121
},
{
"key": "Nightlife",
"doc_count": 5088
}
]
}
}
}
Run Code Online (Sandbox Code Playgroud)
小智 6
我通过buckets_selector解决了这个问题。我们可以在脚本中过滤计数。
```
"aggs": {
"categories_agg": {
"terms": {
"field": "cel_num",
"size": 5000,
"min_doc_count":1
},
"aggs": {
"count_bucket_selector": {
"bucket_selector": {
"buckets_path": {
"count": "_count"
},
"script": {
"lang":"expression",
"inline": "count>5000 && count <10000"
}
}
}
}
}
}
```
Run Code Online (Sandbox Code Playgroud)
归档时间: |
|
查看次数: |
3616 次 |
最近记录: |