相关疑难解决方法(0)

constexpr size_t _m256_float_step_sz = sizeof(__m256) / sizeof(float);
alignas(__m256) float stack_store[100 * _m256_float_step_sz ]{};
__m256& hwvec1 = *reinterpret_cast<__m256*>(&stack_store[0 * _m256_float_step_sz]);

using arr_t = float[_m256_float_step_sz];
arr_t& arr1 = *reinterpret_cast<float(*)[_m256_float_step_sz]>(&hwvec1);

Run Code Online (Sandbox Code Playgroud)

做hwvec1和arr1依赖undefined behaviors 吗？

它们是否违反了严格的别名规则？[基本.lval]/11

或者只有一种定义的内在方式：

__m256 hwvec2 = _mm256_load_ps(&stack_store[0 * _m256_float_step_sz]);
_mm256_store_ps(&stack_store[1 * _m256_float_step_sz], hwvec2);

Run Code Online (Sandbox Code Playgroud)

神箭

c++ x86 intrinsics undefined-behavior language-lawyer

san*_*orn

2019 11-18

6
推荐指数

1
解决办法

1080
查看次数

标签统计

c++ ×2

intrinsics ×2

simd ×2

x86 ×2

16-bit ×1

avx ×1

avx2 ×1

cpu-architecture ×1

floating-point ×1

half-precision-float ×1

language-lawyer ×1

sse ×1

undefined-behavior ×1

vectorization ×1

x86-64 ×1

为什么没有2字节浮点数并且已经存在实现？

什么是打包、解包和扩展打包数据

内在函数中后缀“x”的含义，如“_mm256_set1_epi64x”

硬件 SIMD 向量指针和相应类型之间的“reinterpret_cast”是否是未定义的行为？

标签 统计

标签统计