mongodb MongoDB 聚合 group

MongoDB中聚合(aggregate)主要用于处理数据(诸如统计平均值,求和等)，并返回计算后的数据结果。有点类似sql语句中的 count(*)。

基本语法为：db.collection.aggregate( [ <stage1>, <stage2>, ... ] )

现在在mycol集合中有以下数据：

{ "_id" : 1, "name" : "tom", "sex" : "男", "score" : 100, "age" : 34 }
{ "_id" : 2, "name" : "jeke", "sex" : "男", "score" : 90, "age" : 24 }
{ "_id" : 3, "name" : "kite", "sex" : "女", "score" : 40, "age" : 36 }
{ "_id" : 4, "name" : "herry", "sex" : "男", "score" : 90, "age" : 56 }
{ "_id" : 5, "name" : "marry", "sex" : "女", "score" : 70, "age" : 18 }
{ "_id" : 6, "name" : "john", "sex" : "男", "score" : 100, "age" : 31 }

1、$sum 计算总和。

　　Sql: select sex,count(*) from mycol group by sex

　　MongoDb: db.mycol.aggregate([{

sex', personCount: {$sum: 1}}}])

　　Sql: select sex,sum(score) totalScore from mycol group by sex

　　MongoDb: db.mycol.aggregate([{

sex', totalScore: {

score'}}}])

2、$avg 计算平均值

　　Sql: select sex,avg(score) avgScore from mycol group by sex

　　Mongodb: db.mycol.aggregate([{

score'}}}])

3、$max 获取集合中所有文档对应值得最大值。

　　Sql: select sex,max(score) maxScore from mycol group by sex

　　Mongodb: db.mycol.aggregate([{

score'}}}])

4、$min 获取集合中所有文档对应值得最小值。

　　Sql: select sex,min(score) minScore from mycol group by sex

　　Mongodb: db.mycol.aggregate([{

score'}}}])

5、$push 把文档中某一列对应的所有数据插入值到一个数组中。

　　Mongodb: db.mycol.aggregate([{

score'}}}])

6、$addToSet 把文档中某一列对应的所有数据插入值到一个数组中,去掉重复的

　　db.mycol.aggregate([{

score'}}}])

7、 $first 根据资源文档的排序获取第一个文档数据。

　　 db.mycol.aggregate([{

name'}}}])

8、 $last 根据资源文档的排序获取最后一个文档数据。

　　 db.mycol.aggregate([{

name'}}}])

9、全部统计 null

　　db.mycol.aggregate([{

push:'$score'}}}])

例子

　　现在在t2集合中有以下数据：

　　{ "country" : "china", "province" : "sh", "userid" : "a" }
　　{ "country" : "china", "province" : "sh", "userid" : "b" }
　　{ "country" : "china", "province" : "sh", "userid" : "a" }
　　{ "country" : "china", "province" : "sh", "userid" : "c" }
　　{ "country" : "china", "province" : "bj", "userid" : "da" }
　　{ "country" : "china", "province" : "bj", "userid" : "fa" }

　　需求是统计出每个country/province下的userid的数量（同一个userid只统计一次）

　　过程如下。

　　首先试着这样来统计：

　　db.t2.aggregate([ {

sum:1}} } ])

　　结果是错误的：