用户定义减少不同大小的矢量

Question

用户定义减少不同大小的矢量

我正在尝试为复杂<float>的向量定义自己的减少,在对OpenMP中的数组减少问题的答案之后.

但是我的向量的大小在编译时并不固定,所以我不确定如何在declare reductionpragma中为向量定义初始化器.也就是说,我不能拥有

initializer( omp_priv=TComplexVector(10,0) )

Run Code Online (Sandbox Code Playgroud)

但是矢量需要初始化器.

如何在运行时传递我需要的向量大小的初始化子句？我到目前为止的内容如下:

typedef std::vector<complex<float>> TCmplxVec;

void ComplexAdd(TCmplxVec & x,TCmplxVec & y){
  for (int i=0;i<x.size();i++) 
  {
      x.real()+= y.real();
      //... same for imaginary part and other operations
  }

}

#pragma omp declare reduction(AddCmplx: TCmplxVec: \
ComplexAdd(&omp_out, &omp_in)) initializer( \
omp_priv={TCmplxVec(**here I want a variable length**,0} )

void DoSomeOperation ()
{
    //TCmplxVec vec is empty and anotherVec not

    //so each thread runs the inner loop serially
  #pragma omp parallel for reduction(AddCmplx: vec) 
  for ( n=0 ; n<10 ; ++n )
    {
      for (m=0; m<=someLength; ++m){
        vec[m] += anotherVec[m+someOffset dependend on n and else];
      }
    }
}

Run Code Online (Sandbox Code Playgroud)

Answer 1

Jon*_*rsi 7

你现在必须挖掘一下才能在线找到它,但是在OpenMP标准的第2.15节中讨论了用户声明的缩减,你会发现"特殊标识符omp_orig也可以出现在initializer子句中,它将指的是要减少的原始变量的存储."

所以你可以使用initializer (omp_priv=TCmplxVec(omp_orig.size(),0)),或者只是initalizer ( omp_priv(omp_orig) )在缩小中初始化向量.

所以下面的工作(请注意,您不需要编写自己的例程;您可以使用std :: transform和std :: plus来添加向量;您也可以使用std :: valarray而不是向量,具体取决于方式你使用它们,它已经定义了运算符+):

#include <complex>
#include <vector>
#include <algorithm>
#include <functional>
#include <iostream>
#include <omp.h>

typedef std::vector< std::complex<float> > TCmplxVec;

#pragma omp declare reduction( + : TCmplxVec : \
        std::transform(omp_in.begin( ),  omp_in.end( ), \
                       omp_out.begin( ), omp_out.begin( ), \
                       std::plus< std::complex<float> >( )) ) \
                       initializer (omp_priv(omp_orig))

int main(int argc, char *argv[]) {

    int size;

    if (argc < 2)
        size = 10;
    else
        size = atoi(argv[1]);

    TCmplxVec result(size,0);

    #pragma omp parallel reduction( + : result )
    {
        int tid=omp_get_thread_num();

        for (int i=0; i<std::min(tid+1,size); i++) 
            result[i] += tid;
    }

    for (int i=0; i<size; i++) 
        std::cout << i << "\t" << result[i] << std::endl;

    return 0;
}

Run Code Online (Sandbox Code Playgroud)

运行这个给出

$ OMP_NUM_THREADS=1 ./reduction 8
0   (0,0)
1   (0,0)
2   (0,0)
3   (0,0)
4   (0,0)
5   (0,0)
6   (0,0)
7   (0,0)

$ OMP_NUM_THREADS=4 ./reduction 8
0   (6,0)
1   (6,0)
2   (5,0)
3   (3,0)
4   (0,0)
5   (0,0)
6   (0,0)
7   (0,0)

$ OMP_NUM_THREADS=8 ./reduction 8
0   (28,0)
1   (28,0)
2   (27,0)
3   (25,0)
4   (22,0)
5   (18,0)
6   (13,0)
7   (7,0)

Run Code Online (Sandbox Code Playgroud)

归档时间：	10 年，9 月前
查看次数：	2637 次
最近记录：	10 年，9 月前