如何在 MongoDB 中查找特定类别的最常见值?

kao*_*ify 2 mongodb mongodb-query aggregation-framework

我在 MongoDB 中有一个数据集,如下所示:

{ "name": "Tom's", "category": "coffee shop" },
{ "name": "Red Lobster", "category": "restaurant" },
{ "name": "Tom's", "category": "coffee shop" },
{ "name": "Starbucks", "category": "coffee shop" },
{ "name": "Central Park", "category": "park" },
{ "name": "Office", "category": "office" },
{ "name": "Red Lobster", "category": "restaurant" },
{ "name": "Home", "category": "home" },
{ ... } // and so on
Run Code Online (Sandbox Code Playgroud)

如何找到特定类别最常见的值?例如,最常见出现的值coffee shoprestaurant应该分别是 Tom's 和 Red Lobster。

我当前的$aggregate查询似乎只列出了所有数据集中最常见的值:

db.collection.aggregate(
{ "$group": { "_id": { "name": "$name" }, "count": { "$sum":1 } }}, 
{ "$group": { "_id": "$_id.name", "count": { "$sum": "$count" } }}, 
{ "$sort": { "count":-1 }}
)
Run Code Online (Sandbox Code Playgroud)

use*_*814 5

您可以尝试以下查询。

$group在类别和名称上获取每个类别和名称组合的计数。

$sort按类别和计数描述输入文档。

$group在类别上$first选择出现次数最多的文档。

db.collection_name.aggregate([
  {
    "$group": {
      "_id": {
        "category": "$category",
        "name": "$name"
      },
      "count": {
        "$sum": 1
      }
    }
  },
  {
    "$sort": {
      "_id.category": 1,
      "count": -1
    }
  },
  {
    "$group": {
      "_id": {
        "category": "$_id.category"
      },
      "name": {
        "$first": "$_id.name"
      },
      "count": {
        "$first": "$count"
      }
    }
  }
])
Run Code Online (Sandbox Code Playgroud)