小编Jim*_*yan的帖子

如何在数据集中使用java.time.LocalDate(使用java.lang.UnsupportedOperationException失败:找不到编码器)?

  • Spark 2.1.1
  • 斯卡拉2.11.8
  • Java 8
  • Linux Ubuntu 16.04 LTS

我想将我的RDD转换为数据集.对于这一点,我用的implicits方法toDS()是给我下面的错误:

Exception in thread "main" java.lang.UnsupportedOperationException: No Encoder found for java.time.LocalDate
- field (class: "java.time.LocalDate", name: "date")
- root class: "observatory.TemperatureRow"
    at org.apache.spark.sql.catalyst.ScalaReflection$.org$apache$spark$sql$catalyst$ScalaReflection$$serializerFor(ScalaReflection.scala:602)
    at org.apache.spark.sql.catalyst.ScalaReflection$$anonfun$9.apply(ScalaReflection.scala:596)
    at org.apache.spark.sql.catalyst.ScalaReflection$$anonfun$9.apply(ScalaReflection.scala:587)
    at scala.collection.TraversableLike$$anonfun$flatMap$1.apply(TraversableLike.scala:241)
    at scala.collection.TraversableLike$$anonfun$flatMap$1.apply(TraversableLike.scala:241)
    at scala.collection.immutable.List.foreach(List.scala:381)
    at scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:241)
    at scala.collection.immutable.List.flatMap(List.scala:344)
Run Code Online (Sandbox Code Playgroud)

在我的情况下,我必须使用类型java.time.LocalDate,我不能使用java.sql.data.我已经读过我需要将informe Spark变换为Java类型转换为Sql类型,我这个方向,我构建了下面的2个implicits函数:

implicit def toSerialized(t: TemperatureRow): EncodedTemperatureRow = EncodedTemperatureRow(t.date.toString, t.location, t.temperature)
implicit def fromSerialized(t: EncodedTemperatureRow): TemperatureRow = TemperatureRow(LocalDate.parse(t.date), t.location, t.temperature)
Run Code Online (Sandbox Code Playgroud)

下面,我的应用程序的一些代码:

case class Location(lat: Double, lon: …
Run Code Online (Sandbox Code Playgroud)

scala apache-spark apache-spark-sql

11
推荐指数
1
解决办法
6949
查看次数

标签 统计

apache-spark ×1

apache-spark-sql ×1

scala ×1