-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix peft_config type hint in experimental trainers
#5666
opened Apr 27, 2026 by
albertvillanova
Member
Loading…
Fix missing PEFT availability check when passing peft_config to experimental trainers
#5665
opened Apr 27, 2026 by
albertvillanova
Member
Loading…
Fix missing PEFT validation when passing peft_config to core trainers
#5664
opened Apr 27, 2026 by
albertvillanova
Member
Loading…
Align KTO with DPO: Align PEFT handling
#5661
opened Apr 27, 2026 by
albertvillanova
Member
Loading…
Fix token_type_ids requirement for gemma-3 models in GRPOTrainer
#5644
opened Apr 26, 2026 by
robrui
Loading…
3 of 6 tasks
Fix spurious KL gradients for zero-std reward groups in GRPOTrainer
#5640
opened Apr 24, 2026 by
robrui
Loading…
3 of 6 tasks
Align tiny-Glm4MoeForCausalLM with GLM-4.5 reference config
#5638
opened Apr 24, 2026 by
qgallouedec
Member
Loading…
8 tasks
Add Cohere training chat template (#5471)
#5627
opened Apr 22, 2026 by
dschulmeist
Loading…
4 tasks done
feat: Add generation_kwargs support to LogCompletionsCallback and Wea…
#5625
opened Apr 22, 2026 by
LhaseParth2610
Loading…
4 of 8 tasks
Upload testing suite for
DistillationTrainer
#5615
opened Apr 21, 2026 by
cmpatino
Collaborator
Loading…
3 of 8 tasks
experimental: Self-Distillation Zero
#5609
opened Apr 20, 2026 by
LeonEricsson
Collaborator
Loading…
1 of 8 tasks
support prefetch/prefetch_depth for async GRPO for ~5% speedups
#5602
opened Apr 20, 2026 by
winglian
Contributor
Loading…
1 of 8 tasks
fix(distillation): reverse-KL server path NaN on variable completion length
#5594
opened Apr 19, 2026 by
k1064190
Loading…
3 of 8 tasks
Fix nested vocab_size for DistillationTrainer and GOLDTrainer
#5592
opened Apr 19, 2026 by
Beichen-Ma
Loading…
2 of 8 tasks
Add training chat template for Qwen3-2507
#5574
opened Apr 16, 2026 by
SwayamInSync
Contributor
Loading…
refactor: self distillation trainers (sdpo/sdft/...)
#5573
opened Apr 16, 2026 by
LeonEricsson
Collaborator
Loading…
2 of 8 tasks
Fix empty-target self-distillation loss to stay connected to model graph
#5572
opened Apr 16, 2026 by
walawalagoose
Loading…
3 of 8 tasks
Improve BrowserGym examples for latest OpenEnv version
#5568
opened Apr 16, 2026 by
sergiopaniego
Member
Loading…
8 tasks
Set _tokenizer attribute in experimental trainers
#5566
opened Apr 16, 2026 by
albertvillanova
Member
Loading…
Accept processor in
get_training_chat_template
#5560
opened Apr 15, 2026 by
qgallouedec
Member
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.