相关疑难解决方法(0)

#include <limits.h>
#include <stdio.h>
#include <time.h>

int main(void)
{
    unsigned int k, l, j;
    clock_t tstart = clock();
    for (k = 0, j = 0, l = 0; j < UINT_MAX; ++j)
    {
        ++k;
        k = j;     // <-- comment out this line to remove the MOV instruction
        l += j;
    }
    fprintf(stderr, "%d ms\n", (int)((clock() - tstart) * 1000 / CLOCKS_PER_SEC));
    fflush(stderr);
    return (int)(k + j + l);
}

Run Code Online (Sandbox Code Playgroud)

这为循环生成以下汇编代码(随意生成这个你想要的;你显然不需要Visual C++):

LOOP:
    add edi,esi
    mov …

Run Code Online (Sandbox Code Playgroud)

c x86 assembly cpu-registers micro-optimization

Meh*_*dad

2017 05-26

23
推荐指数

2
解决办法

2113
查看次数

使用sse内在函数最快50%缩放(A)RGB32图像

我想在c ++中尽可能快地缩小图像.本文介绍如何有效地将32位rgb图像平均降低50%.它很快,看起来很好.

我尝试使用sse intrinsics修改该方法.无论是否启用SSE,下面的代码都可以使用.但令人惊讶的是,加速可以忽略不计.

任何人都可以看到改进SSE代码的方法.创建变量shuffle1和shuffle2的两条线似乎是两个候选者(使用一些聪明的移位或类似).

/*
 * Calculates the average of two rgb32 pixels.
 */
inline static uint32_t avg(uint32_t a, uint32_t b)
{
    return (((a^b) & 0xfefefefeUL) >> 1) + (a&b);
}

/*
 * Calculates the average of four rgb32 pixels.
 */
inline static uint32_t avg(const uint32_t a[2], const uint32_t b[2])
{
    return avg(avg(a[0], a[1]), avg(b[0], b[1]));
}

/*
 * Calculates the average of two rows of rgb32 pixels.
 */
void average2Rows(const uint32_t* src_row1, const uint32_t* src_row2, uint32_t* …

Run Code Online (Sandbox Code Playgroud)

c++ sse

bgp*_*000

2017 08-03

5
推荐指数

1
解决办法

1420
查看次数