Skip to content

Commit e0c9d3d

Browse files
committed
Use thread local mode in routing cuda graph class
1 parent bca194d commit e0c9d3d

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

cpp/src/routing/cuda_graph.cuh

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,9 @@ namespace detail {
2020
struct cuda_graph_t {
2121
void start_capture(rmm::cuda_stream_view stream)
2222
{
23-
cudaStreamBeginCapture(stream, cudaStreamCaptureModeGlobal);
23+
// Use ThreadLocal mode to allow multi-threaded batch execution
24+
// Global mode blocks other streams from performing operations during capture
25+
cudaStreamBeginCapture(stream, cudaStreamCaptureModeThreadLocal);
2426
capture_started = true;
2527
}
2628

0 commit comments

Comments
 (0)