1 scala dataframe apache-spark apache-spark-sql
鉴于:
val df = Seq((1L, "04-04-2015")).toDF("id", "date")
val df2 = df.withColumn("month", from_unixtime(unix_timestamp($"date", "dd/MM/yy"), "MMMMM"))
df2.show()
Run Code Online (Sandbox Code Playgroud)
我得到了这个输出:
+---+----------+-----+
| id| date|month|
+---+----------+-----+
| 1|04-04-2015| null|
+---+----------+-----+
Run Code Online (Sandbox Code Playgroud)
但是,我希望输出如下:
+---+----------+-----+
| id| date|month|
+---+----------+-----+
| 1|04-04-2015|April|
+---+----------+-----+
Run Code Online (Sandbox Code Playgroud)
我如何使用 Scala 在 sparkSQL 中做到这一点?
这应该这样做:
val df2 = df.withColumn("month", date_format(to_date($"date", "dd-MM-yyyy"), "MMMM"))
df2.show
+---+----------+-----+
| id| date|month|
+---+----------+-----+
| 1|04-04-2015|April|
+---+----------+-----+
Run Code Online (Sandbox Code Playgroud)
笔记:
文档: