cca*_*444 9 python pandas docker dockerfile python-3.6
我正在尝试创建Docker映像。Dockerfile如下:
# Use the official Python 3.6.5 image
FROM python:3.6.5-alpine3.7
# Set the working directory to /app
WORKDIR /app
# Get the
COPY requirements.txt /app
RUN pip3 install --no-cache-dir -r requirements.txt
# Configuring access to Jupyter
RUN mkdir /notebooks
RUN jupyter notebook --no-browser --ip 0.0.0.0 --port 8888 /notebooks
Run Code Online (Sandbox Code Playgroud)
requirements.txt文件为:
jupyter
numpy==1.14.3
pandas==0.23.0rc2
scipy==1.0.1
scikit-learn==0.19.1
pillow==5.1.1
matplotlib==2.2.2
seaborn==0.8.1
Run Code Online (Sandbox Code Playgroud)
docker build -t standard .当docker尝试安装熊猫时,运行命令给我一个错误。错误如下:
Collecting pandas==0.23.0rc2 (from -r requirements.txt (line 3))
Downloading https://files.pythonhosted.org/packages/46/5c/a883712dad8484ef907a2f42992b122acf2bcecbb5c2aa751d1033908502/pandas-0.23.0rc2.tar.gz (12.5MB)
Complete output from command python setup.py egg_info:
/bin/sh: svnversion: not found
/bin/sh: svnversion: not found
non-existing path in 'numpy/distutils': 'site.cfg'
Could not locate executable gfortran
... (loads of other stuff)
Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-install-xb6f6a5o/pandas/
The command '/bin/sh -c pip3 install --no-cache-dir -r requirements.txt' returned a non-zero code: 1
Run Code Online (Sandbox Code Playgroud)
当我尝试安装较低版本的pandas == 0.22.0时,出现此错误:
Step 5/7 : RUN pip3 install --no-cache-dir -r requirements.txt
---> Running in 5810ea896689
Collecting jupyter (from -r requirements.txt (line 1))
Downloading https://files.pythonhosted.org/packages/83/df/0f5dd132200728a86190397e1ea87cd76244e42d39ec5e88efd25b2abd7e/jupyter-1.0.0-py2.py3-none-any.whl
Collecting numpy==1.14.3 (from -r requirements.txt (line 2))
Downloading https://files.pythonhosted.org/packages/b0/2b/497c2bb7c660b2606d4a96e2035e92554429e139c6c71cdff67af66b58d2/numpy-1.14.3.zip (4.9MB)
Collecting pandas==0.22.0 (from -r requirements.txt (line 3))
Downloading https://files.pythonhosted.org/packages/08/01/803834bc8a4e708aedebb133095a88a4dad9f45bbaf5ad777d2bea543c7e/pandas-0.22.0.tar.gz (11.3MB)
Could not find a version that satisfies the requirement Cython (from versions: )
No matching distribution found for Cython
The command '/bin/sh -c pip3 install --no-cache-dir -r requirements.txt' returned a non-zero code: 1
Run Code Online (Sandbox Code Playgroud)
我也尝试在熊猫之前安装Cyphon和setuptools,但No matching distribution found for Cython在pip3 install pandas行中却给出了相同的错误。
我如何安装熊猫。
我意识到这个问题已经得到回答,但我最近在 dockerized 项目中遇到了类似的 numpy 和 pandas 依赖问题。话虽如此,我希望这会对将来的某人有所帮助。
我的解决方案:
正如Aviv Sela指出的那样,Alpine 默认不包含构建工具,需要通过 Dockerfile 添加。因此,请参阅下面我的 Dockerfile,其中包含在 Alpine 上成功安装容器所需的 numpy 和 pandas 所需的构建包。
FROM python:3.6-alpine3.7
RUN apk add --no-cache --update \
python3 python3-dev gcc \
gfortran musl-dev g++ \
libffi-dev openssl-dev \
libxml2 libxml2-dev \
libxslt libxslt-dev \
libjpeg-turbo-dev zlib-dev
RUN pip install --upgrade pip
ADD requirements.txt .
RUN pip install -r requirements.txt
Run Code Online (Sandbox Code Playgroud)
要求.txt
numpy==1.17.1
pandas==0.25.1
Run Code Online (Sandbox Code Playgroud)
编辑:
在升级 pip RUN 命令之前,将以下(下面的代码片段)添加到 Dockerfile。正如Bishwas Mishra在评论中指出的那样,这对于成功安装 pandas 至关重要。
RUN pip install --upgrade cython
Run Code Online (Sandbox Code Playgroud)
默认情况下,Alpine不包含构建工具。安装构建工具并为语言环境创建符号链接:
$ apk add --update curl gcc g++
$ ln -s /usr/include/locale.h /usr/include/xlocale.h
$ pip install numpy
Run Code Online (Sandbox Code Playgroud)
基于https://wired-world.com/?p=100
我现在可以创建 Docker 镜像了。和 pandas之间肯定存在一些版本不兼容的情况FROM python:3.6.5-alpine3.7。我将Python版本更改为FROM python:3,然后它工作正常(还必须将pillow版本降级为5.1.0)。