Update openenv examples to use `environment_factory` by sergiopaniego · Pull Request #5235 · huggingface/trl

sergiopaniego · 2026-03-06T18:44:17Z

What does this PR do?

TODO:

Migrate notebooks
Update TRL-OpenEnv guide
Add multi-env example

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

- catch.py: Format observations as readable text, normalize reward to 0-1, handle incomplete episodes - echo.py: Rename step->echo and MyEchoEnv->EchoToolEnv, wrap in main() - wordle.py: Normalize reward to 0-1, add RichProgressCallback - sudoku.py: Fix cumulative message handling (diff-based), add board validation for move validity, add progress/hints/tried-moves to responses, add LoRA support, tune defaults for memory efficiency - vllm_generation.py: Add </tool_call> stop token for tool calling loop - grpo_trainer.py: Skip tool calls for environments that are done Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…-examples

sergiopaniego and others added 4 commits March 4, 2026 10:12

Update openenv scripts to use environemnt_factory

2463c6b

Updated examples

a240c15

Merge branch 'main' of github.com:huggingface/trl into update-openenv…

6a93c55

…-examples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update openenv examples to use `environment_factory`#5235

Update openenv examples to use `environment_factory`#5235
sergiopaniego wants to merge 4 commits intomainfrom
update-openenv-examples

sergiopaniego commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sergiopaniego commented Mar 6, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant