从 pool.map 进程返回多个列表?

DrB*_*wts 8 python threadpool python-multiprocessing

Win 7、x64、Python 2.7.12

在下面的代码中,我将设置一些池进程以通过该multiprocessing.Pool.map()方法进行简单的乘法运算。输出数据收集在List_1.

注意:这是对我的实际代码的精简简化。实际应用中涉及到多个列表,个个都很大。

import multiprocessing
import numpy as np

def createLists(branches):

    firstList = branches[:] * node

    return firstList


def init_process(lNodes):

    global node
    node = lNodes
    print 'Starting', multiprocessing.current_process().name


if __name__ == '__main__':

    mgr = multiprocessing.Manager()
    nodes = mgr.list()
    pool_size = multiprocessing.cpu_count()

    branches = [i for i in range(1, 21)]
    lNodes = 10
    splitBranches = np.array_split(branches, int(len(branches)/pool_size))

    pool = multiprocessing.Pool(processes=pool_size, initializer=init_process, initargs=[lNodes])
    myList_1 = pool.map(createLists, splitBranches)

    pool.close() 
    pool.join()  
Run Code Online (Sandbox Code Playgroud)

我现在添加一个额外的计算createLists()并尝试传回两个列表。

import multiprocessing
import numpy as np

def createLists(branches):

    firstList = branches[:] * node
    secondList = branches[:] * node * 2

    return firstList, secondList


def init_process(lNodes):
    global node
    node = lNodes
    print 'Starting', multiprocessing.current_process().name


if __name__ == '__main__':

    mgr = multiprocessing.Manager()
    nodes = mgr.list()
    pool_size = multiprocessing.cpu_count()

    branches = [i for i in range(1, 21)]
    lNodes = 10
    splitBranches = np.array_split(branches, int(len(branches)/pool_size))

    pool = multiprocessing.Pool(processes=pool_size, initializer=init_process, initargs=[lNodes])
    myList_1, myList_2 = pool.map(createLists, splitBranches)

    pool.close() 
    pool.join() 
Run Code Online (Sandbox Code Playgroud)

这引发了跟随错误和回溯..

Traceback (most recent call last):

  File "<ipython-input-6-ff188034c708>", line 1, in <module>
    runfile('C:/Users/nr16508/Local Documents/Inter Trab Angle/Parallel/scratchpad.py', wdir='C:/Users/nr16508/Local Documents/Inter Trab Angle/Parallel')

  File "C:\Users\nr16508\AppData\Local\Continuum\Anaconda2\lib\site-packages\spyder\utils\site\sitecustomize.py", line 866, in runfile
    execfile(filename, namespace)

  File "C:\Users\nr16508\AppData\Local\Continuum\Anaconda2\lib\site-packages\spyder\utils\site\sitecustomize.py", line 87, in execfile
    exec(compile(scripttext, filename, 'exec'), glob, loc)

  File "C:/Users/nr16508/Local Documents/Inter Trab Angle/Parallel/scratchpad.py", line 36, in <module>
    myList_1, myList_2 = pool.map(createLists, splitBranches)

ValueError: too many values to unpack
Run Code Online (Sandbox Code Playgroud)

当我尝试将两个列表合二为一以传回时,即...

return [firstList, secondList]
......
myList = pool.map(createLists, splitBranches)
Run Code Online (Sandbox Code Playgroud)

...输出变得过于混乱,无法进一步处理。

是否有一种方法可以从池化进程中收集多个列表?

don*_*mus 11

这个问题与多处理或线程池无关。它只是关于如何解压缩列表,这可以用标准zip(*...)习语来完成。

myList_1, myList_2 = zip(*pool.map(createLists, splitBranches))
Run Code Online (Sandbox Code Playgroud)