我通过groupby column1和date在Spark中创建了一个数据框,并计算了数量.
val table = df1.groupBy($"column1",$"date").sum("amount")
Run Code Online (Sandbox Code Playgroud)
Column1 |Date |Amount
A |1-jul |1000
A |1-june |2000
A |1-May |2000
A |1-dec |3000
A |1-Nov |2000
B |1-jul |100
B |1-june |300
B |1-May |400
B |1-dec |300
Run Code Online (Sandbox Code Playgroud)
现在,我想添加新列,表中任意两个日期的数量之间存在差异.