相关疑难解决方法(0)

numpy中的非重复随机数

我的问题是:如何在numpy中生成非重复的随机数？

list = np.random.random_integers(20,size=(10))

Run Code Online (Sandbox Code Playgroud)

random numbers numpy non-repetitive

Aca*_*mia

2015 10-05

56
推荐指数

3
解决办法

5万
查看次数

Why is random.sample faster than numpy's random.choice?

I need a way to sample without replacement a certain array a. I tried two approaches (see MCVE below), using random.sample() and np.random.choice.

I assumed the numpy function would be faster, but it turns out it is not. In my tests random.sample is ~15% faster than np.random.choice.

Is this correct, or am I doing something wrong in my example below? If this is correct, why?

import numpy as np
import random
import time
from contextlib import contextmanager …

Run Code Online (Sandbox Code Playgroud)

python random numpy

Gab*_*iel

lucky-day

13
推荐指数

1
解决办法

5334
查看次数

为什么 numpy.random.choice 不使用算术编码？

如果我评估如下：

numpy.random.choice(2, size=100000, p=[0.01, 0.99])

使用一个均匀分布的随机数float，例如r，并决定是否r < 0.01会浪费许多生成的随机位（熵）。我听说（二手）生成伪随机数的计算成本很高，所以我认为numpy不会这样做，而是会在这种情况下使用算术编码之类的方案。

然而，乍一看似乎确实为它所要求的每个样本choice生成了一个。float此外，快速timeit实验表明，生成均匀浮点数实际上比中的样本n更快。np=[0.01, 0.99]

>>> timeit.timeit(lambda : numpy.random.choice(2, size=100000, p=[0.01, 0.99]), number=1000)
1.74494537999999
>>> timeit.timeit(lambda : numpy.random.random(size=100000), number=1000)
0.8165735180009506

Run Code Online (Sandbox Code Playgroud)

真的会像看起来那样为每个样本choice生成一个吗？在某些情况下（特别是当数据很大且分布不均匀时），float使用某种压缩算法不会显着提高性能吗？如果没有，为什么不呢？sizep

python random performance numpy python-3.x

L.G*_*ger

2020 07-31

2
推荐指数

1
解决办法

329
查看次数