为什么我的(新手)代码这么慢?

itz*_*tzy 9 python performance profiler

我正在学习python,并且遇到了这个我以前见过的模型模拟的例子.其中一个功能看起来不必要很长,所以我认为尝试提高效率是一种好习惯.我的尝试,虽然需要更少的代码,大约快1/60.是的,我做了60倍.

我的问题是,我哪里出错了?我已经尝试计算函数的各个部分,但没有看到瓶颈在哪里.

这是原始功能.这是一个人们生活在网格上的模型,他们的幸福取决于他们是否与大多数邻居一样.(这是谢林的隔离模型.)所以我们给一个人一个x,y坐标,并通过检查他们每个邻居的种族来确定他们的幸福.

def is_unhappy(self, x, y):

    race = self.agents[(x,y)]
    count_similar = 0
    count_different = 0

    if x > 0 and y > 0 and (x-1, y-1) not in self.empty_houses:
        if self.agents[(x-1, y-1)] == race:
            count_similar += 1
        else:
            count_different += 1
    if y > 0 and (x,y-1) not in self.empty_houses:
        if self.agents[(x,y-1)] == race:
            count_similar += 1
        else:
            count_different += 1
    if x < (self.width-1) and y > 0 and (x+1,y-1) not in self.empty_houses:
        if self.agents[(x+1,y-1)] == race:
            count_similar += 1
        else:
            count_different += 1
    if x > 0 and (x-1,y) not in self.empty_houses:
        if self.agents[(x-1,y)] == race:
            count_similar += 1
        else:
            count_different += 1        
    if x < (self.width-1) and (x+1,y) not in self.empty_houses:
        if self.agents[(x+1,y)] == race:
            count_similar += 1
        else:
            count_different += 1
    if x > 0 and y < (self.height-1) and (x-1,y+1) not in self.empty_houses:
        if self.agents[(x-1,y+1)] == race:
            count_similar += 1
        else:
            count_different += 1        
    if x > 0 and y < (self.height-1) and (x,y+1) not in self.empty_houses:
        if self.agents[(x,y+1)] == race:
            count_similar += 1
        else:
            count_different += 1        
    if x < (self.width-1) and y < (self.height-1) and (x+1,y+1) not in self.empty_houses:
        if self.agents[(x+1,y+1)] == race:
            count_similar += 1
        else:
            count_different += 1

    if (count_similar+count_different) == 0:
        return False
    else:
        return float(count_similar)/(count_similar+count_different) < self.similarity_threshold 
Run Code Online (Sandbox Code Playgroud)

这是我的代码,正如我所说,它的速度慢得多.我想通过创建一个"偏移"列表来添加到每个人的坐标以确定可能的邻居的位置,检查这是否是有效位置,然后检查邻居的竞赛,以避免上面的所有if语句.

def is_unhappy2(self, x, y):
    thisRace = self.agents[(x,y)]
    count_same = 0
    count_other = 0

    for xo, yo in list(itertools.product([-1,0,1],[-1,0,1])):
        if xo==0 and yo==0:
            # do nothing for case of no offset
            next
        else:
            # check if there's a neighbor at the offset of (xo, yo)
            neighbor = tuple(np.add( (x,y), (xo,yo) ))
            if neighbor in self.agents.keys():
                if self.agents[neighbor] == thisRace:
                    count_same += 1
                else:
                    count_other += 1
    if count_same+count_other == 0:
        return False
    else:
        return float(count_same) / (count_same + count_other) < self.similarity threshold
Run Code Online (Sandbox Code Playgroud)

(创建该类的其余代码位于示例来自的站点上.)

以下是时间结果:

%timeit s.is_unhappy2(49,42)
100 loops, best of 3: 5.99 ms per loop

%timeit s.is_unhappy(49,42)
10000 loops, best of 3: 103 µs per loop
Run Code Online (Sandbox Code Playgroud)

我希望有python知识的人可以立即看到我做错了什么而不必深入了解其余代码的细节.你能明白为什么我的代码比原版差得多吗?

Ste*_*ann 8

不要使用np.add,只需使用neighbor = (x+xo, y+yo).这应该会更快(在我的小测试中快10倍).

你也可以...

  • if neighbor in self.agents:没有问.keys()
  • 遗漏了 list
  • 检查xo or yo并没有空的if块
  • 避免在自我代理中双重查找邻居

结果:

for xo, yo in itertools.product([-1,0,1],[-1,0,1]):
    if xo or yo:
        neighbor = self.agents.get((x+xo, y+yo))
        if neighbor is not None:
            if neighbor == thisRace:
                count_same += 1
            else:
                count_other += 1
Run Code Online (Sandbox Code Playgroud)

你可以添加

self.neighbor_deltas = tuple(set(itertools.product([-1,0,1],[-1,0,1])) - {(0, 0)})
Run Code Online (Sandbox Code Playgroud)

到类初始化程序然后你的函数可以只使用那些预先计算的增量:

for xo, yo in self.neighbor_deltas:
    neighbor = self.agents.get((x+xo, y+yo))
    if neighbor is not None:
        if neighbor == thisRace:
            count_same += 1
        else:
            count_other += 1
Run Code Online (Sandbox Code Playgroud)

恭喜决定改进该作者的可笑重复代码,顺便说一下.