小编use*_*770的帖子

OpenCL实现的算法比普通循环慢

我是并行计算和OpenCL的新手.我按照OpenCLProgramming指南.在卷积实现部分.

我的main.cpp:

#include <iostream>
#include <sstream>
#include <fstream>
#include <string>
#include <OpenCL/OpenCL.h>

using namespace std;

const unsigned int inputSignalWidth = 8;
const unsigned int inputSignalHeight = 8;

cl_uint inputSignal[inputSignalWidth][inputSignalHeight] =
{
    {3, 1, 1, 4, 8, 2, 1, 3},
    {4, 2, 1, 1, 2, 1, 2, 3},
    {4, 4, 4, 4, 3, 2, 2, 2},
    {9, 8, 3, 8, 9, 0, 0, 0},
    {9, 3, 3, 9, 0, 0, 0, 0},
    {0, 9, 0, 8, 0, 0, 0, 0},
    {3, …
Run Code Online (Sandbox Code Playgroud)

c++ macos gpu opencl amd-processor

1
推荐指数
1
解决办法
225
查看次数

标签 统计

amd-processor ×1

c++ ×1

gpu ×1

macos ×1

opencl ×1