我正在尝试计算出一个组中的多数子组。举个例子,假设我的表如下所示:
+--------------------------------------------------+
| city | car_colour | car_type | qty |
+--------------------------------------------------+
| ------------------------------------------------ |
| manchester | Red | Sports | 7 |
| manchester | Red | 4x4 | 9 |
| manchester | Blue | 4x4 | 8 |
| london | Red | Sports | 2 |
| london | Blue | 4x4 | 3 |
| leeds | Red | Sports | 5 |
| leeds | Blue | Sports | 6 |
| leeds | Blue | 4X4 | 1 |
+--------------------------------------------------+
Run Code Online (Sandbox Code Playgroud)
我试图找到一个纯sql解决方案,以便我可以看到:在每个城市,哪种颜色的汽车数量最多。
我可以:
select city, cars, sum(qty)
from table
group by city, cars
Run Code Online (Sandbox Code Playgroud)
要得到:
+------------+------+----+
| manchester | red | 16 |
| manchester | blue | 8 |
| london | red | 2 |
| london | blue | 3 |
| leeds | red | 5 |
| leeds | blue | 7 |
+------------+------+----+
Run Code Online (Sandbox Code Playgroud)
但无论如何我可以使用子查询来获取结果的最大值,这将返回每个城市的最大颜色,因此结果将显示:
+------------+------+
| manchester | red |
| london | blue |
| leeds | blue |
+------------+------+
Run Code Online (Sandbox Code Playgroud)
我可以在 Python 脚本中执行这些计算,但更喜欢纯 SQL 解决方案。
希望这是有道理的,感谢您提前提供的任何帮助:)
汤米
select distinct p.city, p.car_colour,sq.qty as qty
from ( select t.car_colour,t.city, sum(t.qty) as qty
from table1 t
group by t.car_colour,t.city
)p
join ( select q.city,max(q.qty) qty from
( select t.car_colour,t.city, sum(t.qty) as qty
from table1 t
group by t.car_colour,t.city
)q
group by q.city
)sq
on p.city=sq.city and p.qty=sq.qty
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
25163 次 |
| 最近记录: |