无法在 pyspark 中导入 pyarrow

Gal*_*ses 5 pyspark pyarrow

我正在尝试将 pyarrow 与 pyspark 一起使用。但是当我尝试执行时

import pyarrow
Run Code Online (Sandbox Code Playgroud)

我收到以下错误

    In [1]: import pyarrow
---------------------------------------------------------------------------
ImportError                               Traceback (most recent call last)
<ipython-input-1-f1048abcb32d> in <module>
----> 1 import pyarrow

~/opt/anaconda3/lib/python3.7/site-packages/pyarrow/__init__.py in <module>
     47 import pyarrow.compat as compat
     48
---> 49 from pyarrow.lib import cpu_count, set_cpu_count
     50 from pyarrow.lib import (null, bool_,
     51                          int8, int16, int32, int64,

ImportError: dlopen(/Users/user/opt/anaconda3/lib/python3.7/site-packages/pyarrow/lib.cpython-37m-darwin.so, 2): Library not loaded: @rpath/libboost_filesystem.dylib
  Referenced from: /Users/user/opt/anaconda3/lib/libarrow.15.1.0.dylib
  Reason: image not found
Run Code Online (Sandbox Code Playgroud)

我尝试在 conda 环境中安装 pyarrow,降级到 python 3.6 但没有成功。

有人有任何建议来解决这个问题吗?

Tho*_*tin 3

看起来 PyArrow 没有正确安装。因此,请尝试清理旧包,然后使用以下命令再次安装 pyarrow,

   {{ conda install -c conda-forge pyarrow }}
Run Code Online (Sandbox Code Playgroud)