We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 1f59fbd commit 59ea068Copy full SHA for 59ea068
src/tools/perf/cuda/cuda_kernel.cuh
@@ -54,7 +54,7 @@ public:
54
(completed - m_ctx.completed_iters);
55
m_last_report_time = cur_time;
56
m_ctx.completed_iters = completed;
57
- __threadfence();
+ __threadfence_system();
58
59
m_next_report_iter = ucs_min(completed + (m_report_interval_ns / iter_time),
60
m_max_iters);
0 commit comments