Actions: huggingface/trl
Actions
6,870 workflow runs
6,870 workflow runs
grpo_trainer.py): Variational Sequence-Level Soft Policy Optim…
Tests
#15506:
Commit 406d406
pushed
by
qgallouedec
grpo_trainer.py): Variational Sequence-Level Soft Policy Optimization (VESPO)
Tests
#15505:
Pull request #5199
synchronize
by
qgallouedec
grpo_trainer.py): Variational Sequence-Level Soft Policy Optimization (VESPO)
Tests
#15502:
Pull request #5199
synchronize
by
qgallouedec
vllm_mode to "colocate" and add v0→v1 migration gu…
Tests
#15496:
Commit c0eabc4
pushed
by
qgallouedec
vllm_mode to "colocate" and add v0→v1 migration guide
Tests
#15493:
Pull request #5255
synchronize
by
qgallouedec
vllm_mode to "colocate" and add v0→v1 migration guide
Tests
#15492:
Pull request #5255
synchronize
by
qgallouedec
bfd-requeue to bfd_split (#5189)
Tests
#15491:
Commit 6c0fccd
pushed
by
qgallouedec
bfd-requeue to bfd_split
Tests
#15489:
Pull request #5189
synchronize
by
qgallouedec
bfd-requeue to bfd_split
Tests
#15487:
Pull request #5189
synchronize
by
qgallouedec
bfd-requeue to bfd_split
Tests
#15484:
Pull request #5189
synchronize
by
qgallouedec
bfd-requeue to bfd_split
Tests
#15483:
Pull request #5189
synchronize
by
mariosasko
environment_factory
Tests
#15480:
Pull request #5235
synchronize
by
sergiopaniego
environment_factory
Tests
#15479:
Pull request #5235
synchronize
by
sergiopaniego
environment_factory
Tests
#15478:
Pull request #5235
synchronize
by
sergiopaniego
environment_factory
Tests
#15477:
Pull request #5235
synchronize
by
sergiopaniego