Commit 8fbb949

committed

Update base for Update on "[rl] Add CI for numerics test against vllm native inference"

Test cases: 1. Integration tests: - single GPU, no compile + cudagraph - multiple GPU (with TP), no compile + cudagraph - multiple GPU, with compile + cudagraph - This test runs on A10G (default CI GPU type) 3. Numerics parity test: vLLM native model vs vLLM + TorchTitan wrapper. - test_weights_match: max_diff <= 1e-5 (exact weight loading) - test_attention_module: atol=1e-5 (TP=1) - test_end_to_end_logits: atol=1e-3 (TP=1) - We would need to run numerics test for only TP=1. This is because we are assuming both torchtitan and vllm will make sure their multi-GPU implementation is on par with single GPU. And we can add more numerics test under parallelism if needed. - This test runs on H100, and runs FA3 kernel for attention. [ghstack-poisoned]

1 parent e97d04c commit 8fbb949Copy full SHA for 8fbb949

0 file changed

-0

lines changed

0 file changed

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit 8fbb949

0 file changed

0 file changed

File tree

0 file changed

0 file changed

0 commit comments