小编App*_*per的帖子

std::array<float, MyClass::FEATURE_LENGTH> MyClass::normalize(const std::array<float, FEATURE_LENGTH>& arr) {
    std::array<float, MyClass::FEATURE_LENGTH> output{};
    double mod = 0.0;

    for (size_t i = 0; i < arr.size(); ++i) {
        mod += arr[i] * arr[i];
    }

    double mag = std::sqrt(mod);

    if (mag == 0) {
        throw std::logic_error("The input vector is a zero vector");
    }

    for (size_t i = 0; i < arr.size(); ++i) {
        output[i] = arr[i] / mag;
    }

    return output;
}

Run Code Online (Sandbox Code Playgroud)

c++ performance normalization

App*_*per

2019 08-13

5
推荐指数

1
解决办法

3万
查看次数

为什么当与较新的 libstdc++.so 链接时，C++ 可执行文件运行得如此之快？

我有一个项目（此处的代码），我在其中运行基准测试来比较计算点积的不同方法（朴素方法、特征库、SIMD 实现等）的性能。我正在新的 Centos 7.6 VM 上进行测试。我注意到当我使用不同版本的时libstdc++.so.6，我得到的性能明显不同。

当我启动一个新的 Centos 7.6 实例时，默认的 C++ 标准库是libstdc++.so.6.0.19. 当我运行我的基准测试可执行文件（链接到这个版本的libstdc++）时，输出如下：

Naive Implementation, 1000000 iterations: 1448.74 ns average time
Optimized Implementation, 1000000 iterations: 1094.2 ns average time
AVX2 implementation, 1000000 iterations: 1069.57 ns average time
Eigen Implementation, 1000000 iterations: 1027.21 ns average time
AVX & FMA implementation 1, 1000000 iterations: 1028.68 ns average time
AVX & FMA implementation 2, 1000000 iterations: 1021.26 ns average time

Run Code Online (Sandbox Code Playgroud)

如果我下载libstdc++.so.6.0.26并更改符号链接libstdc++.so.6 …

c++ optimization gcc libstdc++

App*_*per

2020 01-04

4
推荐指数

1
解决办法

363
查看次数

如何使用 CMake 3.15 及更高版本查找并链接 CUDA 库？

我在类 Unix 系统上使用 CMake 3.15-rc3。

我需要将我正在构建的程序与多个 CUDA 库链接，包括cublas, cufft, cusolver, curand, nppicc, nppial, nppist, nppidei, nppig, nppitc, npps。

根据我在网上找到的信息，我需要做这样的事情：

add_executable(test benchmark.cpp)\nfind_package(CUDALibs)\ntarget_link_libraries(test CUDA::cudart CUDA::cublas CUDA::cufft CUDA::cusolver CUDA::curand CUDA::nppicc CUDA::nppial CUDA::nppist CUDA::nppidei CUDA::nppig CUDA::nppitc CUDA::npps)\n

Run Code Online (Sandbox Code Playgroud)\n

当我运行时，make出现以下错误：

CMake Warning at CMakeLists.txt:27 (find_package):\n  By not providing "FindCUDALibs.cmake" in CMAKE_MODULE_PATH this project has\n  asked CMake to find a package configuration file provided by "CUDALibs",\n  but CMake …

Run Code Online (Sandbox Code Playgroud)

c++ cuda cmake

App*_*per

2023 07-17

4
推荐指数

1
解决办法

7739
查看次数

如何使用 OpenCV Python 和 GStreamer 后端创建 x264 RTSP 服务器

我的目标是使用 GStreamer 后端使用 OpenCV Python 创建一个 RTSP 服务器。我将 RGB 图像存储为 OpenCV Mat，并且我想创建一个VideoWriter可以写入 RTSP 接收器的图像。输出视频必须采用 x264 编码。

我相信使用 GStreamer 管道并向VideoWriter构造函数提供管道参数，然后将帧推送到 VideoWriter 可以轻松实现这一点，但问题是我没有使用 GStreamer 的经验，我发现它非常令人困惑。

我在 SO 上找到的答案不完整，使用特定的硬件解码器（例如 NVIDIA Jetson），或者过于复杂。我想找到一个适用于 CPU 的更通用的解决方案。

python opencv rtsp gstreamer

App*_*per

lucky-day

4
推荐指数

1
解决办法

1万
查看次数

无法为 ARM 交叉编译 postgresql 12.2

希望从 ARM 源代码交叉编译 postgresql 获得一些帮助。我正在尝试在X86_64 Ubuntu 18.04.4. 我正在使用传递 autoconf 以下参数：

CC=arm-linux-gnueabihf-gcc CXX=arm-linux-gnueabihf-g++ AR=arm-linux-gnueabihf-ar RANLIB=arm-linux-gnueabihf-ranlib ../configure --host=arm-linux-gnueabihf --without-readline --without-zlib

Run Code Online (Sandbox Code Playgroud)

当我使用 postgresql release 使用上述参数运行 configure 时9.6.2，它会成功，并且我能够正确构建库。但是，我想使用最新版本，目前是 V 12.2。

当我使用 V 运行上述命令时12.2，收到以下错误消息：

// a bunch of successfull output from autoconf before error message...
checking for /dev/urandom... configure: error: cannot check for file existence when cross compiling

Run Code Online (Sandbox Code Playgroud)

任何想法如何解决这一问题？这是他们的 autoconf 中的错误，还是我这边做错了什么？

linux postgresql autoconf cross-compiling

App*_*per

2020 05-13

2
推荐指数

1
解决办法

983
查看次数