如何在Linux中创建高分辨率计时器来衡量程序性能?

sj7*_*755 33 c linux precision timer high-resolution

我正在尝试将GPU与CPU性能进行比较.对于NVIDIA GPU,我一直在使用这些cudaEvent_t类型来获得非常精确的时序.

对于CPU我一直在使用以下代码:

// Timers
clock_t start, stop;
float elapsedTime = 0;

// Capture the start time

start = clock();

// Do something here
.......

// Capture the stop time
stop = clock();
// Retrieve time elapsed in milliseconds
elapsedTime = (float)(stop - start) / (float)CLOCKS_PER_SEC * 1000.0f;
Run Code Online (Sandbox Code Playgroud)

显然,如果你在几秒钟内计算,那段代码就是好的.此外,结果有时候出来很奇怪.

有谁知道在Linux中创建高分辨率计时器的某种方法?

NPE*_*NPE 46

退房clock_gettime,这是高分辨率计时器的POSIX接口.

如果在阅读了联机帮助页后,您仍然想知道CLOCK_REALTIME和之间的区别CLOCK_MONOTONIC,请参阅CLOCK_REALTIME和CLOCK_MONOTONIC之间的区别?

有关完整示例,请参见以下页面:http://www.guyrutenberg.com/2007/09/22/profiling-code-using-clock_gettime/

#include <iostream>
#include <time.h>
using namespace std;

timespec diff(timespec start, timespec end);

int main()
{
    timespec time1, time2;
    int temp;
    clock_gettime(CLOCK_PROCESS_CPUTIME_ID, &time1);
    for (int i = 0; i< 242000000; i++)
        temp+=temp;
    clock_gettime(CLOCK_PROCESS_CPUTIME_ID, &time2);
    cout<<diff(time1,time2).tv_sec<<":"<<diff(time1,time2).tv_nsec<<endl;
    return 0;
}

timespec diff(timespec start, timespec end)
{
    timespec temp;
    if ((end.tv_nsec-start.tv_nsec)<0) {
        temp.tv_sec = end.tv_sec-start.tv_sec-1;
        temp.tv_nsec = 1000000000+end.tv_nsec-start.tv_nsec;
    } else {
        temp.tv_sec = end.tv_sec-start.tv_sec;
        temp.tv_nsec = end.tv_nsec-start.tv_nsec;
    }
    return temp;
}
Run Code Online (Sandbox Code Playgroud)

  • OP发布在C,但是您的答案是C ++。仍然有用,但是在没有C ++库的我的ZedBoard上不可用:D要修复,请在timespec前面加上struct并删除提示。 (2认同)
  • 所以答案明确地讲了`CLOCK_REALTIME`和`CLOCK_MONOTONIC`,但是我们在代码示例中以`CLOCK_PROCESS_CPUTIME_ID`结尾吗?有人可以清理吗?那是什么? (2认同)

Ale*_*lex 18

总结到目前为止提供的信息,这些是典型应用所需的两个功能.

#include <time.h>

// call this function to start a nanosecond-resolution timer
struct timespec timer_start(){
    struct timespec start_time;
    clock_gettime(CLOCK_PROCESS_CPUTIME_ID, &start_time);
    return start_time;
}

// call this function to end a timer, returning nanoseconds elapsed as a long
long timer_end(struct timespec start_time){
    struct timespec end_time;
    clock_gettime(CLOCK_PROCESS_CPUTIME_ID, &end_time);
    long diffInNanos = (end_time.tv_sec - start_time.tv_sec) * (long)1e9 + (end_time.tv_nsec - start_time.tv_nsec);
    return diffInNanos;
}
Run Code Online (Sandbox Code Playgroud)

下面是一个如何使用它们来计算计算输入列表方差所需时间的示例.

struct timespec vartime = timer_start();  // begin a timer called 'vartime'
double variance = var(input, MAXLEN);  // perform the task we want to time
long time_elapsed_nanos = timer_end(vartime);
printf("Variance = %f, Time taken (nanoseconds): %ld\n", variance, time_elapsed_nanos);
Run Code Online (Sandbox Code Playgroud)

  • 你不是在忽视`timespec`的`tv_sec`吗?另外,为什么`CLOCK_PROCESS_CPUTIME_ID`而不是`CLOCK_MONOTONIC`? (6认同)