相关疑难解决方法(0)

在spark数据帧中减去两列为空的列

我刚起步,我有数据框df:

+----------+------------+-----------+
| Column1  | Column2    | Sub       |                          
+----------+------------+-----------+
| 1        | 2          | 1         |                                         
+----------+------------+-----------+
| 4        | null       | null      |                          
+----------+------------+-----------+
| 5        | null       | null      |                          
+----------+------------+-----------+
| 6        | 8          | 2         |                          
+----------+------------+-----------+
Run Code Online (Sandbox Code Playgroud)

当减去两列时,一列为null,因此结果列也为null。

df.withColumn("Sub", col(A)-col(B))
Run Code Online (Sandbox Code Playgroud)

预期输出应为:

+----------+------------+-----------+
|  Column1 | Column2    | Sub       |                          
+----------+------------+-----------+
| 1        | 2          | 1         |                                           
+----------+------------+-----------+
| 4        | null       | 4         |                          
+----------+------------+-----------+
| 5        | null       | 5         |                          
+----------+------------+-----------+
| …
Run Code Online (Sandbox Code Playgroud)

scala apache-spark apache-spark-sql

2
推荐指数
2
解决办法
8348
查看次数

标签 统计

apache-spark ×1

apache-spark-sql ×1

scala ×1