LeM*_*sel 5 string random performance hex go
我需要生成许多固定长度的随机十六进制字符串。我找到此解决方案如何在golang中生成固定长度的随机字符串?
我正在做这样的事情:
const letterBytes = "abcdef0123456789"
const (
letterIdxBits = 6 // 6 bits to represent a letter index
letterIdxMask = 1<<letterIdxBits - 1 // All 1-bits, as many as letterIdxBits
letterIdxMax = 63 / letterIdxBits // # of letter indices fitting in 63 bits
)
var src = rand.NewSource(time.Now().UnixNano())
// RandStringBytesMaskImprSrc ...
// Src: /sf/answers/2228262851/
func RandStringBytesMaskImprSrc(n int) string {
b := make([]byte, n)
// A src.Int63() generates 63 random bits, enough for letterIdxMax characters!
for i, cache, remain := n-1, src.Int63(), letterIdxMax; i >= 0; {
if remain == 0 {
cache, remain = src.Int63(), letterIdxMax
}
if idx := int(cache & letterIdxMask); idx < len(letterBytes) {
b[i] = letterBytes[idx]
i--
}
cache >>= letterIdxBits
remain--
}
return string(b)
}
var tryArr = make([]string, 10000)
for i := 0; i < 10000; i++ {
tryArr[i] = RandStringBytesMaskImprSrc(8)
}
Run Code Online (Sandbox Code Playgroud)
但是我遇到了这个紧急错误
panic: runtime error: index out of range
goroutine 36 [running]:
math/rand.(*rngSource).Int63(0x11bb1300, 0x8, 0x8)
D:/Applications/Go/src/math/rand/rng.go:231 +0xa0
main.RandStringBytesMaskImprSrc(0x8, 0x11f81be8, 0x8)
main.go:60 +0x5f
Run Code Online (Sandbox Code Playgroud)
错误似乎在 for i, cache, remain := n-1, src.Int63(), letterIdxMax; i >= 0;
,但我不知道为什么会出现此错误。
在Go中生成大量固定长度的随机十六进制字符串的最快,最简单的方法是什么?
基准测试
package bench
import (
"encoding/hex"
"math/rand"
"testing"
"time"
)
const letterBytes = "abcdef0123456789"
const (
letterIdxBits = 4 // 4 bits to represent a letter index
letterIdxMask = 1<<letterIdxBits - 1 // All 1-bits, as many as letterIdxBits
letterIdxMax = 63 / letterIdxBits // # of letter indices fitting in 63 bits
)
var src1 = rand.NewSource(time.Now().UnixNano())
var src2 = rand.New(rand.NewSource(time.Now().UnixNano()))
// RandStringBytesMaskImprSrc returns a random hexadecimal string of length n.
func RandStringBytesMaskImprSrc1(n int) string {
b := make([]byte, n)
for i, cache, remain := n-1, src1.Int63(), letterIdxMax; i >= 0; {
if remain == 0 {
cache, remain = src1.Int63(), letterIdxMax
}
if idx := int(cache & letterIdxMask); idx < len(letterBytes) {
b[i] = letterBytes[idx]
i--
}
cache >>= letterIdxBits
remain--
}
return string(b)
}
func RandStringBytesMaskImprSrc2(n int) string {
b := make([]byte, (n+1)/2) // can be simplified to n/2 if n is always even
if _, err := src2.Read(b); err != nil {
panic(err)
}
return hex.EncodeToString(b)[:n]
}
func BenchmarkRandStringBytesMaskImprSrc1(b *testing.B) {
for n := 0; n < b.N; n++ {
_ = RandStringBytesMaskImprSrc1(8)
}
}
func BenchmarkRandStringBytesMaskImprSrc2(b *testing.B) {
for n := 0; n < b.N; n++ {
_ = RandStringBytesMaskImprSrc2(8)
}
}
goos: windows
goarch: 386
BenchmarkRandStringBytesMaskImprSrc1-4 20000000 116 ns/op 16 B/op 2 allocs/op
BenchmarkRandStringBytesMaskImprSrc2-4 10000000 231 ns/op 24 B/op 3 allocs/op
PASS
ok command-line-arguments 5.139s
Run Code Online (Sandbox Code Playgroud)
=> icza RandStringBytesMaskImprSrc
解决方案更有效
*math/rand.Rand 是一个 io.Reader,因此读取 N 个随机字节然后对它们进行十六进制编码很简单:
package main
import (
"encoding/hex"
"fmt"
"math/rand"
"time"
)
var src = rand.New(rand.NewSource(time.Now().UnixNano()))
func main() {
fmt.Println(RandStringBytesMaskImprSrc(4))
}
// RandStringBytesMaskImprSrc returns a random hexadecimal string of length n.
func RandStringBytesMaskImprSrc(n int) string {
b := make([]byte, (n+1)/2) // can be simplified to n/2 if n is always even
if _, err := src.Read(b); err != nil {
panic(err)
}
return hex.EncodeToString(b)[:n]
}
Run Code Online (Sandbox Code Playgroud)
实际上,您发布的代码可以运行,即使其中存在错误(见下文),它仍然不会引起恐慌(只会使性能变得更糟)。
您发布的堆栈跟踪表明math/rand
包中存在错误,我没有遇到过。请发布完整代码和 Go 版本 + env (go version
和go env
)。
事实证明,请求者是RandStringBytesMaskImprSrc()
从多个 goroutine 并发调用的。RandStringBytesMaskImprSrc()
使用共享rand.Source
实例,该实例对于并发使用不安全,因此math/rand
包会出现恐慌。修复方法是为每个 goroutine 创建一个单独的 goroutine rand.Source()
,并将其传递给RandStringBytesMaskImprSrc()
.
开头的“配置”常量有一个错误:
const letterBytes = "abcdef0123456789"
const (
letterIdxBits = 6 // 6 bits to represent a letter index
letterIdxMask = 1<<letterIdxBits - 1 // All 1-bits, as many as letterIdxBits
letterIdxMax = 63 / letterIdxBits // # of letter indices fitting in 63 bits
)
Run Code Online (Sandbox Code Playgroud)
该常量letterIdxBits
应包含表示符号索引需要多少位。由于您使用的是 16 个元素的字母表( 的长度letterBytes
),因此 16 种组合仅需要 4 位:
letterIdxBits = 4 // 4 bits to represent a letter index
Run Code Online (Sandbox Code Playgroud)
测试示例:
var tryArr = make([]string, 10)
for i := range tryArr {
tryArr[i] = RandStringBytesMaskImprSrc(8)
}
fmt.Println(tryArr)
Run Code Online (Sandbox Code Playgroud)
输出(在Go Playground上尝试):
[d3e7caa6 a69c9b7d c37a613b 92d5a43b 64059c4a 4f08141b 70130c65 1546daaf fe140fcd 0d714e4d]
Run Code Online (Sandbox Code Playgroud)
(注意:由于 Go Playground 上的开始时间是固定的并且输出被缓存,因此您将始终看到这些随机生成的字符串。在您的计算机上运行它以查看随机结果。)