为什么在子进程中运行mpirun时Python会挂起?

MRo*_*lin 5 python subprocess mpi4py

我有一个使用mpi4py的非常简单的MPI脚本

# mpitest.py
from mpi4py import MPI
import time

comm = MPI.COMM_WORLD
rank = comm.Get_rank()

time.sleep(100)
Run Code Online (Sandbox Code Playgroud)

如果我使用mpirun正常运行,一切正常

$ mpirun --np 4 python mpitest.py  # just fine
Run Code Online (Sandbox Code Playgroud)

但是,如果我使用子进程模块在Python中运行此程序,则一切都会运行,但是我的解释器会变得很慢

>>> import subprocess
>>> proc = subprocess.Popen(['mpirun', '--np', '2', 'python', 'mpitest.py'])
Run Code Online (Sandbox Code Playgroud)

我已经尝试过类似的关键字参数shell=True

环境

我已经使用最新的Linux Miniconda安装了Python,mpi4py和mpich

mrocklin@carbon:~/workspace/play$ conda list | grep mpi
mpi4py                    2.0.0                    py36_2  
mpich2                    1.4.1p1                       0  
Run Code Online (Sandbox Code Playgroud)

https://conda.io/miniconda.html

可重复的步骤

mrocklin@carbon:~/workspace/play$ conda create -n test-mpi python=3.6 mpi4py
Fetching package metadata .........
Solving package specifications: .

Package plan for installation in environment /home/mrocklin/Software/anaconda/envs/test-mpi:

The following NEW packages will be INSTALLED:

    mpi4py:     2.0.0-py36_2 
    mpich2:     1.4.1p1-0    
    openssl:    1.0.2l-0     
    pip:        9.0.1-py36_1 
    python:     3.6.2-0      
    readline:   6.2-2        
    setuptools: 27.2.0-py36_0
    sqlite:     3.13.0-0     
    tk:         8.5.18-0     
    wheel:      0.29.0-py36_0
    xz:         5.2.3-0      
    zlib:       1.2.11-0     

xz-5.2.3-0.tar 100% |################################| Time: 0:00:00   3.79 MB/s
zlib-1.2.11-0. 100% |################################| Time: 0:00:00   5.68 MB/s
#
# To activate this environment, use:
# > source activate test-mpi
#
# To deactivate an active environment, use:
# > source deactivate
#

mrocklin@carbon:~/workspace/play$ source activate test-mpi
(test-mpi) mrocklin@carbon:~/workspace/play$ python
Python 3.6.2 |Continuum Analytics, Inc.| (default, Jul 20 2017, 13:51:32) 
[GCC 4.4.7 20120313 (Red Hat 4.4.7-1)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import subprocess
>>> proc = subprocess.Popen(['mpirun', '--np', '2', 'python', 'mpitest.py'])
Run Code Online (Sandbox Code Playgroud)

MRo*_*lin 2

stdin=subprocess.DEVNULL这可以通过在调用中添加关键字来解决,subprocess.Popen如下所示:

>>> proc = subprocess.Popen(['mpirun', '--np', '2', 'python', 'mpitest.py'], 
                            stdin=subprocess.DEVNULL)
Run Code Online (Sandbox Code Playgroud)

事实证明,它mpirun稍微劫持了标准输入管道,从而导致许多注定要进入该python进程的击键无法到达。