尝试使用DataflowRunner时发生ClassNotFound异常

Guy*_*ari 5 java dataflow beam maven google-cloud-dataflow

我正在尝试使用Apache Beam 0.6.0在GCP上启动数据流作业。我正在使用shade插件编译一个uber jar,因为我无法使用“ mvn:execjava”启动该作业。我包括此依赖项:

<dependency>
  <groupId>org.apache.beam</groupId>
  <artifactId>beam-runners-google-cloud-dataflow-java</artifactId>
  <version>0.6.0-SNAPSHOT</version>
</dependency>
Run Code Online (Sandbox Code Playgroud)

我收到以下异常:

Exception in thread "main" java.lang.IllegalArgumentException: Unknown 'runner' specified 'DataflowRunner', supported pipeline runners [DirectRunner]
    at org.apache.beam.sdk.options.PipelineOptionsFactory.parseObjects(PipelineOptionsFactory.java:1609)
    at org.apache.beam.sdk.options.PipelineOptionsFactory.access$400(PipelineOptionsFactory.java:104)
    at org.apache.beam.sdk.options.PipelineOptionsFactory$Builder.as(PipelineOptionsFactory.java:289)
    at com.disney.dtss.desa.tools.SpannerSinkTest.main(SpannerSinkTest.java:116)
Caused by: java.lang.ClassNotFoundException: DataflowRunner
    at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:331)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
    at java.lang.Class.forName0(Native Method)
    at java.lang.Class.forName(Class.java:264)
    at org.apache.beam.sdk.options.PipelineOptionsFactory.parseObjects(PipelineOptionsFactory.java:1595)
Run Code Online (Sandbox Code Playgroud)

我还有其他东西吗?

小智 5

尝试

mvn compile exec:java -Dexec.mainClass=Yourmain Class -Pdataflow-runner

*最后添加-Pdataflow-runner

  • 在 `pom.xml` 中,如果依赖项被定义为配置文件的一部分,请确保为 `mvn` 命令指定配置文件。Apache Beam 的默认 WordCount 示例为“DataflowRunner”执行此操作。如果您不关心配置文件,只需将依赖项定义移动到 pom 文件的 `&lt;dependencies&gt;` 部分。 (2认同)