squ*_*rem 5 profiler cuda nvidia instructions
我一直在玩NVIDIA分析器(nvprof),有两个我不明白的特定指标:
inst_inter_thread_communication
Number of inter-thread communication instructions executed by non-predicated threads
inst_misc
Number of miscellaneous instructions executed by non-predicated threads
Run Code Online (Sandbox Code Playgroud)
我只是想知道什么指令是线程间通信指令以及哪些指令属于杂项.
参考:http: //docs.nvidia.com/cuda/profiler-users-guide/#metrics-reference