保持numpy数组的每一行的n个最大值，其他所有零

Question

保持numpy数组的每一行的n个最大值，其他所有零

我有一个numpy的数据数组，我只需要保留n最高值，而将其他所有内容都清零。

我当前的解决方案：

import numpy as np
np.random.seed(30)

# keep only the n highest values
n = 3

# Simple 2x5 data field for this example, real life application will be exteremely large
data = np.random.random((2,5))
#[[ 0.64414354  0.38074849  0.66304791  0.16365073  0.96260781]
# [ 0.34666184  0.99175099  0.2350579   0.58569427  0.4066901 ]]


# find indices of the n highest values per row
idx = np.argsort(data)[:,-n:]
#[[0 2 4]
# [4 3 1]]


# put those values back in a blank array
data_ = np.zeros(data.shape) # blank slate
for i in xrange(data.shape[0]):
    data_[i,idx[i]] = data[i,idx[i]]

# Each row contains only the 3 highest values per row or the original data
#[[ 0.64414354  0.          0.66304791  0.          0.96260781]
# [ 0.          0.99175099  0.          0.58569427  0.4066901 ]]

Run Code Online (Sandbox Code Playgroud)

在上面的代码中，data_具有n最高的值，其他所有东西都归零。即使data.shape[1]小于，效果也很好n。但是唯一的问题是for loop，这很慢，因为我的实际用例是在非常大的阵列上。

是否有可能摆脱for循环？

Answer 1

DSM*_*DSM 5

您可以按向量化方式对np.argsortnp.argsort 的结果进行两次操作，第一个获得索引顺序，第二个获得排名，然后使用np.where或仅使用乘法将其他所有内容归零：

In [116]: np.argsort(data)
Out[116]: 
array([[3, 1, 0, 2, 4],
       [2, 0, 4, 3, 1]])

In [117]: np.argsort(np.argsort(data))  # these are the ranks
Out[117]: 
array([[2, 1, 3, 0, 4],
       [1, 4, 0, 3, 2]])

In [118]: np.argsort(np.argsort(data)) >= data.shape[1] - 3
Out[118]: 
array([[ True, False,  True, False,  True],
       [False,  True, False,  True,  True]], dtype=bool)

In [119]: data * (np.argsort(np.argsort(data)) >= data.shape[1] - 3)
Out[119]: 
array([[ 0.64414354,  0.        ,  0.66304791,  0.        ,  0.96260781],
       [ 0.        ,  0.99175099,  0.        ,  0.58569427,  0.4066901 ]])

In [120]: np.where(np.argsort(np.argsort(data)) >= data.shape[1]-3, data, 0)
Out[120]: 
array([[ 0.64414354,  0.        ,  0.66304791,  0.        ,  0.96260781],
       [ 0.        ,  0.99175099,  0.        ,  0.58569427,  0.4066901 ]])

Run Code Online (Sandbox Code Playgroud)

归档时间：	8 年，3 月前
查看次数：	277 次
最近记录：	8 年，3 月前