And*_*rew 8 python optimization numpy matplotlib scipy
帮助我的代码更快:我的python代码需要生成一个落在边界矩形内的2D点阵.我把一些生成这个点阵的代码(如下所示)拼凑在了一起.但是,这个函数被多次调用,并且已经成为我应用程序中的一个严重瓶颈.
我确信有更快的方法可以做到这一点,可能涉及numpy数组而不是列表.有什么建议可以更快,更优雅地做到这一点吗?
功能描述:我有两个2D矢量,v1和v2.这些矢量定义了一个晶格.在我的例子中,我的矢量定义了一个几乎但不完全是六边形的格子.我想在这个晶格上生成一些边界矩形中的所有2D点的集合.在我的例子中,矩形的一个角是(0,0),其他角是正坐标.
示例:如果我的边界矩形的远角位于(3,3),并且我的点阵向量是:
v1 = (1.2, 0.1)
v2 = (0.2, 1.1)
Run Code Online (Sandbox Code Playgroud)
我希望我的函数返回点:
(1.2, 0.1) #v1
(2.4, 0.2) #2*v1
(0.2, 1.1) #v2
(0.4, 2.2) #2*v2
(1.4, 1.2) #v1 + v2
(2.6, 1.3) #2*v1 + v2
(1.6, 2.3) #v1 + 2*v2
(2.8, 2.4) #2*v1 + 2*v2
Run Code Online (Sandbox Code Playgroud)
我不关心边缘情况; 例如,函数返回(0,0)无关紧要.
我目前正在做的缓慢的方式:
import numpy, pylab
def generate_lattice( #Help me speed up this function, please!
image_shape, lattice_vectors, center_pix='image', edge_buffer=2):
##Preprocessing. Not much of a bottleneck:
if center_pix == 'image':
center_pix = numpy.array(image_shape) // 2
else: ##Express the center pixel in terms of the lattice vectors
center_pix = numpy.array(center_pix) - (numpy.array(image_shape) // 2)
lattice_components = numpy.linalg.solve(
numpy.vstack(lattice_vectors[:2]).T,
center_pix)
lattice_components -= lattice_components // 1
center_pix = (lattice_vectors[0] * lattice_components[0] +
lattice_vectors[1] * lattice_components[1] +
numpy.array(image_shape)//2)
num_vectors = int( ##Estimate how many lattice points we need
max(image_shape) / numpy.sqrt(lattice_vectors[0]**2).sum())
lattice_points = []
lower_bounds = numpy.array((edge_buffer, edge_buffer))
upper_bounds = numpy.array(image_shape) - edge_buffer
##SLOW LOOP HERE. 'num_vectors' is often quite large.
for i in range(-num_vectors, num_vectors):
for j in range(-num_vectors, num_vectors):
lp = i * lattice_vectors[0] + j * lattice_vectors[1] + center_pix
if all(lower_bounds < lp) and all(lp < upper_bounds):
lattice_points.append(lp)
return lattice_points
##Test the function and display the output.
##No optimization needed past this point.
lattice_vectors = [
numpy.array([-40., -1.]),
numpy.array([ 18., -37.])]
image_shape = (1000, 1000)
spots = generate_lattice(image_shape, lattice_vectors)
fig=pylab.figure()
pylab.plot([p[1] for p in spots], [p[0] for p in spots], '.')
pylab.axis('equal')
fig.show()
Run Code Online (Sandbox Code Playgroud)
由于lower_bounds
并且upper_bounds
只是2元素数组,numpy可能不是这里的正确选择.尝试更换
if all(lower_bounds < lp) and all(lp < upper_bounds):
Run Code Online (Sandbox Code Playgroud)
基本的Python东西:
if lower1 < lp and lower2 < lp and lp < upper1 and lp < upper2:
Run Code Online (Sandbox Code Playgroud)
根据timeit,第二种方法要快得多:
>>> timeit.timeit('all(lower < lp)', 'import numpy\nlp=4\nlower = numpy.array((1,5))')
3.7948939800262451
>>> timeit.timeit('lower1 < 4 and lower2 < 4', 'lp = 4\nlower1, lower2 = 1,5')
0.074192047119140625
Run Code Online (Sandbox Code Playgroud)
根据我的经验,只要你不需要处理n-diminsional数据,如果你不需要双精度浮点数,通常使用基本的Python数据类型和构造而不是numpy更快,这有点过载在这种情况下 - 看看这个其他问题.
另一个小改进可能是range(-num_vectors, num_vectors)
只计算一次然后重复使用它.另外,您可能希望使用产品迭代器而不是嵌套for循环 - 尽管我认为这些更改不会对性能产生重大影响.
如果你想对整个事物进行矢量化,请生成一个方形晶格,然后对其进行剪切。然后切掉盒子外面的边缘。
这是我想出的。虽然还有很多可以改进的地方,但这是基本的想法。
def generate_lattice(image_shape, lattice_vectors) :
center_pix = numpy.array(image_shape) // 2
# Get the lower limit on the cell size.
dx_cell = max(abs(lattice_vectors[0][0]), abs(lattice_vectors[1][0]))
dy_cell = max(abs(lattice_vectors[0][1]), abs(lattice_vectors[1][1]))
# Get an over estimate of how many cells across and up.
nx = image_shape[0]//dx_cell
ny = image_shape[1]//dy_cell
# Generate a square lattice, with too many points.
# Here I generate a factor of 4 more points than I need, which ensures
# coverage for highly sheared lattices. If your lattice is not highly
# sheared, than you can generate fewer points.
x_sq = np.arange(-nx, nx, dtype=float)
y_sq = np.arange(-ny, nx, dtype=float)
x_sq.shape = x_sq.shape + (1,)
y_sq.shape = (1,) + y_sq.shape
# Now shear the whole thing using the lattice vectors
x_lattice = lattice_vectors[0][0]*x_sq + lattice_vectors[1][0]*y_sq
y_lattice = lattice_vectors[0][1]*x_sq + lattice_vectors[1][1]*y_sq
# Trim to fit in box.
mask = ((x_lattice < image_shape[0]/2.0)
& (x_lattice > -image_shape[0]/2.0))
mask = mask & ((y_lattice < image_shape[1]/2.0)
& (y_lattice > -image_shape[1]/2.0))
x_lattice = x_lattice[mask]
y_lattice = y_lattice[mask]
# Translate to the centre pix.
x_lattice += center_pix[0]
y_lattice += center_pix[1]
# Make output compatible with original version.
out = np.empty((len(x_lattice), 2), dtype=float)
out[:, 0] = y_lattice
out[:, 1] = x_lattice
return out
Run Code Online (Sandbox Code Playgroud)