Sou*_*uad 7 python cron ipython cassandra pyspark
我一直在尝试通过pyspark执行脚本.py但我一直收到此错误:
11:55 $ ./bin/spark-submit --jars spark-cassandra-connector-2.0.0-M2-s_2.11.jar --py-files example.py
Exception in thread "main" java.lang.IllegalArgumentException: Missing application resource.
at org.apache.spark.launcher.CommandBuilderUtils.checkArgument(CommandBuilderUtils.java:241)
at org.apache.spark.launcher.SparkSubmitCommandBuilder.buildSparkSubmitArgs(SparkSubmitCommandBuilder.java:160)
at org.apache.spark.launcher.SparkSubmitCommandBuilder.buildSparkSubmitCommand(SparkSubmitCommandBuilder.java:276)
at org.apache.spark.launcher.SparkSubmitCommandBuilder.buildCommand(SparkSubmitCommandBuilder.java:151)
at org.apache.spark.launcher.Main.main(Main.java:86)
Run Code Online (Sandbox Code Playgroud)
我可以通过这样做轻松执行它:
11:57 $ pyspark --jars spark-cassandra-connector-2.0.0-M2-s_2.11.jar
Run Code Online (Sandbox Code Playgroud)
然后在IPython(交互式shell)中逐块粘贴代码.但我想把脚本放在一个cronjob中,以便它可以自动执行.我需要一个命令来放入cronjob并且spark-submit无法正常工作.有任何想法吗?
小智 7
你需要再次将python文件放在最后。
./bin/spark-submit --jars spark-cassandra-connector-2.0.0-M2-s_2.11.jar --py-files example.py example.py
Run Code Online (Sandbox Code Playgroud)