小编gay*_*hri的帖子

spark scala - 将多个行合并为一个

我有一个数据帧

|--id:string (nullable = true)
|--ddd:struct (nullable = true)
  |-- aaa: string (nullable = true)
  |-- bbb: long(nullable = true)
  |-- ccc: string (nullable = true)
  |-- eee: long(nullable = true)
Run Code Online (Sandbox Code Playgroud)

我有这样的输出

 id     |  ddd
--------------------------
   1    | [hi,1,this,2]
   2    | [hello,6,good,3]
   1    | [hru,2,where,7]
   3    | [in,4,you,1]
   2    | [how,4,to,3]
Run Code Online (Sandbox Code Playgroud)

我希望预期的o/p为:

   id   |  ddd
  --------------------
   1    | [hi,1,this,2],[hru,2,where,7]
   2    | [hello,6,good,3],[how,4,to,3]
   3    | [in,4,you,1]
Run Code Online (Sandbox Code Playgroud)

请帮忙

scala apache-spark apache-spark-1.6

1
推荐指数
1
解决办法
3151
查看次数

Spark scala重命名地图列

我想将重命名key下图作为name,_1rownum,_2status

  root
  |-- id: string (nullable = true)
  |-- info: map (nullable = true)
  |    |-- key: string
  |    |-- value: struct (valueContainsNull = true)
  |    |    |-- _1: long (nullable = false)
  |    |    |-- _2: string (nullable = true)
Run Code Online (Sandbox Code Playgroud)

请帮忙

scala apache-spark

-1
推荐指数
1
解决办法
524
查看次数

标签 统计

apache-spark ×2

scala ×2

apache-spark-1.6 ×1