我们有一个非常大的 CSV 文件,已作为 dask 数据框导入。我举一个小例子来解释这个问题。
import dask.dataframe as dd
df = dd.read_csv("name and path of the file.csv")
df.head()
Run Code Online (Sandbox Code Playgroud)
输出
col1 | col2 | col3 | col4
22 | Nan | 23 | 56
12 | 54 | 22 | 36
48 | Nan | 2 | 45
76 | 32 | 13 | 6
23 | Nan | 43 | 8
67 | 54 | 56 | 64
16 | 32 | 32 | 6
3 | 54 | 64 …Run Code Online (Sandbox Code Playgroud) 我是个新手Power BI,我尝试在其中设置 python 脚本。我已经更改了选项并激活了Python scripting. 之后我收到错误:
DataSource.Error: ADO.NET: Python script error.
Traceback (most recent call last):
File "PythonScriptWrapper.PY", line 2, in <module>
import os, pandas, matplotlib.pyplot
File "C:\Users\mm\AppData\Local\Continuum\anaconda3\lib\site-packages\pandas\__init__.py", line 19, in <module>
"Missing required dependencies {0}".format(missing_dependencies))
ImportError: Missing required dependencies ['numpy']
Details:
DataSourceKind=Python
Run Code Online (Sandbox Code Playgroud)
我检查了conda列表,它包含numpy.
我已经安装numpy并pandas再次收到以下消息
C:\Users\mm\AppData\Local\Continuum\anaconda3>py -m pip install numpy
pip is configured with locations that require TLS/SSL, however the ssl module in Python is not available.
Requirement …Run Code Online (Sandbox Code Playgroud)