Skip to content

Commit 45f95a8

Browse files
committed
update
1 parent f55dd43 commit 45f95a8

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

_posts/2025-04-18-openrlhf-vllm.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -93,3 +93,8 @@ for bundle_indices in [[0, 1], [2, 3]]:
9393
```
9494

9595
[The complete example](https://docs.vllm.ai/en/latest/getting_started/examples/rlhf_colocate.html) walks through initializing Ray with a specified GPU count, creating a placement group to manage resources, and defining both training actors and inference engines. The training actors manage model initialization and weight updates, while the inference engines serve models via vLLM. Weight synchronization is carried out using CUDA IPC or NCCL, ensuring coherence and efficiency throughout the RLHF pipeline.
96+
97+
98+
## Acknowledgements
99+
100+
We would like to express our sincere gratitude to [Kaichao You](https://github.com/youkaichao) from the **vLLM** team and [Jian Hu](https://github.com/hijkzzz) from **OpenRLHF** for their invaluable contributions and close collaboration throughout this integration. Kaichao’s guidance on vLLM’s internals and RLHF API design significantly accelerated our implementation of high-throughput inference, while Jian’s deep insights into distributed RLHF architecture and training optimization were instrumental in refining OpenRLHF’s usability and performance. Their support not only made this work possible, but also laid the foundation for future advancements in efficient, scalable RLHF systems.

0 commit comments

Comments
 (0)