MongoDB中的条件分组

oko*_*oko 11 mongodb mongodb-query aggregation-framework

我在MongoDB中有一系列文档(检查事件),如下所示:

{
    "_id" : ObjectId("5397a78ab87523acb46f56"),
    "inspector_id" : ObjectId("5397997a02b8751dc5a5e8b1"),
    "status" : 'defect',
    "utc_timestamp" : ISODate("2014-06-11T00:49:14.109Z")
}

{
    "_id" : ObjectId("5397a78ab87523acb46f57"),
    "inspector_id" : ObjectId("5397997a02b8751dc5a5e8b2"),
    "status" : 'ok',
    "utc_timestamp" : ISODate("2014-06-11T00:49:14.109Z")
}
Run Code Online (Sandbox Code Playgroud)

我需要得到一个如下所示的结果集:

[
  {
    "date" : "2014-06-11",
    "defect_rate" : '.92' 
  },  
  {
    "date" : "2014-06-11",
    "defect_rate" : '.84' 
  }, 
]
Run Code Online (Sandbox Code Playgroud)

换句话说,我需要每天获得平均缺陷率.这可能吗?

Nei*_*unn 18

聚合框架是您想要的:

db.collection.aggregate([
    { "$group": {
        "_id": {
            "year": { "$year": "$utc_timestamp" },
            "month": { "$month": "$utc_timestamp" },
            "day": { "$dayOfMonth": "$utc_timestamp" },
        },
        "defects": {
            "$sum": { "$cond": [
                { "$eq": [ "$status", "defect" ] },
                1,
                0
            ]}
        },
        "totalCount": { "$sum": 1 }
    }},
    { "$project": {
        "defect_rate": {
            "$cond": [
                { "$eq": [ "$defects", 0 ] },
                0,
                { "$divide": [ "$defects", "$totalCount" ] }
            ]
        }
    }}
])
Run Code Online (Sandbox Code Playgroud)

因此,首先使用日期聚合运算符对当天进行分组,并在给定日期获取项目的totalCount.$cond此处操作符的使用确定"状态"是否实际上是缺陷,并且结果是$sum仅计算"缺陷"值的条件.

一旦每天对这些进行分组,您只需$divide将结果与另一个进行检查,$cond以确保您没有除以零.