我正在设置GeoSpark Python,在安装所有先决条件后,我正在运行非常基本的代码示例来测试它。
from pyspark.sql import SparkSession
from geo_pyspark.register import GeoSparkRegistrator
spark = SparkSession.builder.\
getOrCreate()
GeoSparkRegistrator.registerAll(spark)
df = spark.sql("""SELECT st_GeomFromWKT('POINT(6.0 52.0)') as geom""")
df.show()
Run Code Online (Sandbox Code Playgroud)
我尝试使用python3 basic.pyand运行它spark-submit basic.py,两者都给我这个错误:
Traceback (most recent call last):
File "/home/jessica/Downloads/geo_pyspark/basic.py", line 8, in <module>
GeoSparkRegistrator.registerAll(spark)
File "/home/jessica/Downloads/geo_pyspark/geo_pyspark/register/geo_registrator.py", line 22, in registerAll
cls.register(spark)
File "/home/jessica/Downloads/geo_pyspark/geo_pyspark/register/geo_registrator.py", line 27, in register
spark._jvm. \
TypeError: 'JavaPackage' object is not callable
Run Code Online (Sandbox Code Playgroud)
我正在使用 Java 8、Python 3、Apache Spark 2.4,我JAVA_HOME的设置正确,我正在运行 Linux Mint 19。我SPARK_HOME的也设置为:
$ printenv …Run Code Online (Sandbox Code Playgroud)