Skip to content

Refactor vllm_is_ratio tests to avoid combinatorial explosion

138bb26
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

Add vLLM importance sampling ratio support for GRPO loss #1088

Refactor vllm_is_ratio tests to avoid combinatorial explosion
138bb26
Select commit
Loading
Failed to load commit list.