相关疑难解决方法(0)

pyspark: Method isBarrier([]) does not exist

I'm trying to learn Spark following some hello-word level example such as below, using pyspark. I got a "Method isBarrier([]) does not exist" error, full error included below the code.

from pyspark import SparkContext

if __name__ == '__main__':
    sc = SparkContext('local[6]', 'pySpark_pyCharm')
    rdd = sc.parallelize([1, 2, 3, 4, 5, 6, 7, 8])
    rdd.collect()
    rdd.count()
Run Code Online (Sandbox Code Playgroud)

在此处输入图片说明

Although, when I start a pyspark session in command line directly and type in the same code, it works fine:

在此处输入图片说明

My setup:

  • windows 10 Pro …

python apache-spark pyspark

4
推荐指数
1
解决办法
845
查看次数

PySpark中不存在方法showString([class java.lang.Integer,class java.lang.Integer,class java.lang.Boolean])

这是代码段:

from pyspark import SparkContext
from pyspark.sql.session import SparkSession

sc = SparkContext()
spark = SparkSession(sc)
d = spark.read.format("csv").option("header", True).option("inferSchema", True).load('file.csv')
d.show()
Run Code Online (Sandbox Code Playgroud)

之后遇到错误:

An error occurred while calling o163.showString. Trace:
py4j.Py4JException: Method showString([class java.lang.Integer, class java.lang.Integer, class java.lang.Boolean]) does not exist
Run Code Online (Sandbox Code Playgroud)

所有其他方法都可以正常工作。试图进行大量研究但徒劳无功。任何线索将不胜感激

java py4j apache-spark apache-spark-sql pyspark

3
推荐指数
1
解决办法
1712
查看次数

标签 统计

apache-spark ×2

pyspark ×2

apache-spark-sql ×1

java ×1

py4j ×1

python ×1