Commit 8fbb949
committed
Update base for Update on "[rl] Add CI for numerics test against vllm native inference"
Test cases:
1. Integration tests:
- single GPU, no compile + cudagraph
- multiple GPU (with TP), no compile + cudagraph
- multiple GPU, with compile + cudagraph
- This test runs on A10G (default CI GPU type)
3. Numerics parity test: vLLM native model vs vLLM + TorchTitan wrapper.
- test_weights_match: max_diff <= 1e-5 (exact weight loading)
- test_attention_module: atol=1e-5 (TP=1)
- test_end_to_end_logits: atol=1e-3 (TP=1)
- We would need to run numerics test for only TP=1. This is because we are assuming both torchtitan and vllm will make sure their multi-GPU implementation is on par with single GPU. And we can add more numerics test under parallelism if needed.
- This test runs on H100, and runs FA3 kernel for attention.
[ghstack-poisoned]1 parent e97d04c commit 8fbb949
File tree
0 file changed
+0
-0
lines changed0 file changed
+0
-0
lines changed
0 commit comments