小编Sam*_*Sam的帖子

我是 HIVE 和 SPARK 的新手。

考虑我在 SQL 中有以下查询。

select col1, col2, min(col3), first(col4) from tablename group by col1, col2

因为我不想将 col4 包含在组中，所以我首先采用了（col4）（但我希望显示 col4）

我想在 Hive 中编写相同的查询，但在 Hive 中没有第一个函数。

参考：https : //docs.treasuredata.com/articles/hive-aggregate-functions
我想在Spark SQL 中编写相同的查询（使用数据帧）。同样，在 spark 聚合函数中也没有第一个函数。(* 可用的聚合方法有avg, max, min, sum, count. *)