kao*_*ify 2 mongodb mongodb-query aggregation-framework
我在 MongoDB 中有一个数据集,如下所示:
{ "name": "Tom's", "category": "coffee shop" },
{ "name": "Red Lobster", "category": "restaurant" },
{ "name": "Tom's", "category": "coffee shop" },
{ "name": "Starbucks", "category": "coffee shop" },
{ "name": "Central Park", "category": "park" },
{ "name": "Office", "category": "office" },
{ "name": "Red Lobster", "category": "restaurant" },
{ "name": "Home", "category": "home" },
{ ... } // and so on
Run Code Online (Sandbox Code Playgroud)
如何找到特定类别最常见的值?例如,最常见出现的值coffee shop和restaurant应该分别是 Tom's 和 Red Lobster。
我当前的$aggregate查询似乎只列出了所有数据集中最常见的值:
db.collection.aggregate(
{ "$group": { "_id": { "name": "$name" }, "count": { "$sum":1 } }},
{ "$group": { "_id": "$_id.name", "count": { "$sum": "$count" } }},
{ "$sort": { "count":-1 }}
)
Run Code Online (Sandbox Code Playgroud)
您可以尝试以下查询。
$group在类别和名称上获取每个类别和名称组合的计数。
$sort按类别和计数描述输入文档。
$group在类别上$first选择出现次数最多的文档。
db.collection_name.aggregate([
{
"$group": {
"_id": {
"category": "$category",
"name": "$name"
},
"count": {
"$sum": 1
}
}
},
{
"$sort": {
"_id.category": 1,
"count": -1
}
},
{
"$group": {
"_id": {
"category": "$_id.category"
},
"name": {
"$first": "$_id.name"
},
"count": {
"$first": "$count"
}
}
}
])
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
6793 次 |
| 最近记录: |