我能够在独立模式下运行我的程序。但是当我尝试以集群模式在 Dataproc 中运行时,出现以下错误。PL 有帮助。我的 build.sbt
name := "spark-kafka-streaming"
version := "0.1"
scalaVersion := "2.12.10"
val sparkVersion = "2.4.5"
libraryDependencies += "org.apache.spark" %% "spark-sql" % sparkVersion % "provided"
libraryDependencies += "org.apache.spark" %% "spark-core" % sparkVersion % "provided"
libraryDependencies += "com.google.cloud.bigdataoss" % "gcs-connector" % "hadoop2-2.1.3"
libraryDependencies += "org.apache.spark" %% "spark-sql-kafka-0-10" % sparkVersion
assemblyOption in assembly := (assemblyOption in assembly).value.copy(includeScala = false)
assemblyJarName in assembly := s"${name.value}_${scalaBinaryVersion.value}-${sparkVersion}_${version.value}.jar"
assemblyMergeStrategy in assembly := {
case PathList("org","aopalliance", xs @ _*) => MergeStrategy.last
case PathList("javax", "inject", xs …Run Code Online (Sandbox Code Playgroud)