[Scheduler][colocated batch gen] Is all_reduce necessary in PollBasedBarrier when non-leader GPUs are always noop? #17841

jiangyukunok · 2026-01-28T00:35:09Z

jiangyukunok
Jan 28, 2026

Based on https://github.com/sgl-project/sglang/blob/main/python/sglang/srt/managers/scheduler.py#L817
My understanding is that non-leader (attn_tp_rank != 0) GPU process always has noop=True, which constantly contributes True to the MIN all-reduce operation: https://github.com/sgl-project/sglang/blob/main/python/sglang/srt/utils/poll_based_barrier.py#L26, meaning that the all_reduce result is effectively determined solely by the leader GPU's local_arrived value.

Would it be possible to simplify this by having only the leader GPU track the blocked/unblocked states and removing the synchronization for non-leader GPUs? Later the broadcast provides synchronization anyway.

jiangyukunok · 2026-01-28T19:37:35Z

jiangyukunok
Jan 28, 2026
Author

Looking further into the code, I think the SGLANG_ENABLE_COLOCATED_BATCH_GEN mode may not work when dp size > 1. When there are multiple scheduler process with attn_tp_rank == 0, we can end up holding messages in pending queue forever or having messages processed immediately before receiving the unblock signal.

1 reply

jiangyukunok Jan 29, 2026
Author

Alright I think I found the answer, when dp attention is used, data parallel control is triggered.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Scheduler][colocated batch gen] Is all_reduce necessary in PollBasedBarrier when non-leader GPUs are always noop? #17841

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[Scheduler][colocated batch gen] Is all_reduce necessary in PollBasedBarrier when non-leader GPUs are always noop? #17841

Uh oh!

jiangyukunok Jan 28, 2026

Replies: 1 comment · 1 reply

Uh oh!

jiangyukunok Jan 28, 2026 Author

Uh oh!

jiangyukunok Jan 29, 2026 Author

jiangyukunok
Jan 28, 2026

Replies: 1 comment 1 reply

jiangyukunok
Jan 28, 2026
Author

jiangyukunok Jan 29, 2026
Author