Jee*_*evs 3 avro apache-spark apache-spark-sql
我无法使用spark-avro库读取spark文件.以下是我采取的步骤:
spark-shell --jars avro/spark-avro_2.10-0.1.jar
执行git自述文件中给出的命令:
import com.databricks.spark.avro._
import org.apache.spark.sql.SQLContext
val sqlContext = new SQLContext(sc)
val episodes = sqlContext.avroFile("episodes.avro")
Run Code Online (Sandbox Code Playgroud)操作sqlContext.avroFile("episodes.avro")失败,并显示以下错误:
scala> val episodes = sqlContext.avroFile("episodes.avro")
java.lang.IncompatibleClassChangeError: class com.databricks.spark.avro.AvroRelation has interface org.apache.spark.sql.sources.TableScan as super class
at java.lang.ClassLoader.defineClass1(Native Method)
at java.lang.ClassLoader.defineClass(ClassLoader.java:760)
at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
at java.net.URLClassLoader.defineClass(URLClassLoader.java:467)
at java.net.URLClassLoader.access$100(URLClassLoader.java:73)
at java.net.URLClassLoader$1.run(URLClassLoader.java:368)
Run Code Online (Sandbox Code Playgroud)我的错.自述文件清楚地说:
Versions
Spark changed how it reads / writes data in 1.4, so please use the correct version of this dedicated for your spark version
1.3 -> 1.0.0
1.4+ -> 1.1.0-SNAPSHOT
Run Code Online (Sandbox Code Playgroud)
我用spark:1.3.1
和spark-avro : 1.1.0
. 当我使用spark-avro时1.0.0
,它起作用了.
归档时间: |
|
查看次数: |
3645 次 |
最近记录: |