为什么multiprocessing.pool的这种实现不起作用?

fac*_*est 5 python multithreading numpy sympy multiprocessing

这是我正在使用的代码:

def initFunction(arg1, arg2):
    def funct(value):
        return arg1 * arg2 * value
    return funct

os.system("taskset -p 0xff %d" % os.getpid()) 
pool = Pool(processes=4)
t = np.linspace(0,1,10e3)

a,b,c,d,e,f,g,h = sy.symbols('a,b,c,d,e,f,g,h',commutative=False)

arg1 = sy.Matrix([[a,b],[c,d]])
arg2 = sy.Matrix([[e,f],[g,h]])
myFunct = initFunction(arg1, arg2)

m3 = map(myFunct,t) # this works
m4 = pool.map(myFunct,t) # this does NOT work
Run Code Online (Sandbox Code Playgroud)

我得到的错误是:

Traceback (most recent call last):
   File "<stdin>", line 1, in <module>
   File "/usr/lib/python2.7/dist-packages/spyderlib/widgets/externalshell/sitecustomize.py", line 540, in runfile
      execfile(filename, namespace)
   File "/home/justin/Research/mapTest.py", line 46, in <module>
      m4 = pool.map(myFunct,t) 
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 251, in map
      return self.map_async(func, iterable, chunksize).get()
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 558, in get
      raise self._value
cPickle.PicklingError: Can't pickle <type 'function'>: attribute lookup __builtin__.function failed
Run Code Online (Sandbox Code Playgroud)

那么这个错误意味着什么呢?我该如何对这个地图功能进行多处理?

dan*_*ano 7

在使用过程中在进程之间传递的对象multiprocessing必须可以从__main__模块导入,以便可以在子进程中对其进行unpickled.嵌套函数funct是不可导入的__main__,因此您会收到该错误.您可以使用以下内容functools.partial来实现您的尝试:

from multiprocessing import Pool
from functools import partial

def funct(arg1, arg2, value):
    return arg1 * arg2 * value


if __name__ == "__main__":
    t = [1,2,3,4]
    arg1 = 4 
    arg2 = 5 

    pool = Pool(processes=4)
    func = partial(funct, arg1, arg2)
    m4 = pool.map(func,t)
    print(m4)
Run Code Online (Sandbox Code Playgroud)

输出:

[20, 40, 60, 80]
Run Code Online (Sandbox Code Playgroud)