Jas*_*uch 3 hadoop python-3.x py4j apache-spark pyspark
我安装了 Spark,但在将 pyspark 模块加载到 ipython 时遇到了问题。我收到以下错误:
ModuleNotFoundError Traceback (most recent call last)
<ipython-input-2-49d7c4e178f8> in <module>
----> 1 import pyspark
/opt/spark/python/pyspark/__init__.py in <module>
44
45 from pyspark.conf import SparkConf
---> 46 from pyspark.context import SparkContext
47 from pyspark.rdd import RDD
48 from pyspark.files import SparkFiles
/opt/spark/python/pyspark/context.py in <module>
27 from tempfile import NamedTemporaryFile
28
---> 29 from py4j.protocol import Py4JError
30
31 from pyspark import accumulators
ModuleNotFoundError: No module named 'py4j'Run Code Online (Sandbox Code Playgroud)
小智 8
如果可以直接运行spark,可能需要修复环境变量PYTHONPATH。检查目录中的文件名$SPARK_HOME/python/lib/。如果是 Spark 版本 2.4.3,文件是py4j-0.10.7-src.zip:
export PYTHONPATH=$SPARK_HOME/python:$SPARK_HOME/python/lib/py4j-0.10.7-src.zip:$PYTHONPATH
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
9163 次 |
| 最近记录: |