Skip to content

Commit 59ea068

Browse files
committed
PERF: Use __threadfence_system
1 parent 1f59fbd commit 59ea068

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/tools/perf/cuda/cuda_kernel.cuh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@ public:
5454
(completed - m_ctx.completed_iters);
5555
m_last_report_time = cur_time;
5656
m_ctx.completed_iters = completed;
57-
__threadfence();
57+
__threadfence_system();
5858

5959
m_next_report_iter = ucs_min(completed + (m_report_interval_ns / iter_time),
6060
m_max_iters);

0 commit comments

Comments
 (0)