You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: torchtitan/experiments/rl/README.md
+6-7Lines changed: 6 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,14 +1,13 @@
1
-
# Run vLLM inference with TorchTitan Qwen3 Model
1
+
# RL Training with TorchTitan and vLLM
2
2
3
-
This directory contains code to run a single canonical model definition (TorchTitan model definition) with vLLM inference engine (not batch-invariant yet, working in progress). This work is actively developing and only supports inference for now.
4
-
5
-
This work is inspired by https://github.com/vllm-project/vllm/pull/28685.
3
+
This directory contains code for RL training using TorchTitan model definitions with vLLM inference engine for fast rollout generation.
6
4
7
5
## Overview
8
-
The integration consists of two main components:
6
+
The integration consists of the following components:
9
7
10
-
1.**Model Adapter** (`model/qwen3.py`): A custom model class that extends vLLM's `Qwen3ForCausalLM` to handle TorchTitan checkpoint naming conventions
11
-
2.**Inference Script** (`inference_example.py`): A simple script to register the model and run inference
8
+
1.**vLLM Model Wrapper** (`models/vllm_wrapper.py`): Adapts TorchTitan models for vLLM's inference engine
9
+
2.**RL Training Loop** (`simple_grpo_sum_digits.py`): GRPO-based RL training with Monarch actors
10
+
3.**Inference Script** (`inference_example.py`): Standalone inference using the vLLM engine
0 commit comments