我想测量GPU的时间内核,如何在NVIDIA CUDA中测量它?例如
__global__ void kernelSample() { some code here get start time some code here get stop time some code here }
cuda gpu gpgpu nvidia
cuda ×1
gpgpu ×1
gpu ×1
nvidia ×1