nas*_*ass 7 scala cassandra apache-spark
我正在尝试使用spark scala在Cassandra数据库中保存数据集,但我在运行代码时遇到异常:链接使用:http://rustyrazorblade.com/2015/01/introduction-to-spark-cassandra/
error:
could not find implicit value for parameter rwf: com.datastax.spark.connector.writer.RowWriterFctory[FoodToUserIndex]
food_index.saveToCassandra("tutorial", "food_to_user_index")
^
Run Code Online (Sandbox Code Playgroud)
.scala
def main(args: Array[String]): Unit = {
val conf = new SparkConf(true)
.set("spark.cassandra.connection.host", "localhost")
.set("spark.executor.memory", "1g")
.set("spark.cassandra.connection.native.port", "9042")
val sc = new SparkContext(conf)
case class FoodToUserIndex(food: String, user: String)
val user_table = sc.cassandraTable[CassandraRow]("tutorial", "user").select("favorite_food","name")
val food_index = user_table.map(r => new FoodToUserIndex(r.getString("favorite_food"), r.getString("name")))
food_index.saveToCassandra("tutorial", "food_to_user_index")}
Run Code Online (Sandbox Code Playgroud)
build.sbt
name := "intro_to_spark"
version := "1.0"
scalaVersion := "2.11.2"
libraryDependencies += "org.apache.spark" %% "spark-core" % "1.2.0"
libraryDependencies += "com.datastax.spark" %% "spark-cassandra-connector" % "1.2.0-rc3"
Run Code Online (Sandbox Code Playgroud)
如果将scala和cassandra连接器的版本更改为2.10,1.1.0它的工作原理.但我需要使用scala 2.11:
scalaVersion := "2.10.4"
libraryDependencies += "org.apache.spark" %% "spark-core" % "1.2.0"
libraryDependencies += "com.datastax.spark" %% "spark-cassandra-connector" % "1.1.0" withSources() withJavadoc()
Run Code Online (Sandbox Code Playgroud)
小智 1
它与“datastax Spark-cassandra-connector”版本有关,而不是 Scala 版本。
到目前为止,版本 1.2.x 缺少自定义类的保存。
尝试“datastax Spark-cassandra-connector”版本 1.1.1 和 Scala 2.11,它应该可以工作
注意:确保 Spark 也针对 Scala 2.11 进行编译。
| 归档时间: |
|
| 查看次数: |
1799 次 |
| 最近记录: |