Skip to content

v0.4.4: improved vLLM perf with on-device-sampling disable, fix speculation algo, PEFT update for GRPO

Choose a tag to compare

@tengomucho tengomucho released this 12 Jan 10:14

What's Changed

Inference

Training

Other

Full Changelog: v0.4.3...v0.4.4