| orphan: |
|---|
.. currentmodule:: torchrl.envs.llm
The environment layer orchestrates data loading, tool execution, reward computation, and formatting.
.. autosummary::
:toctree: generated/
:template: rl_template.rst
ChatEnv
CountdownEnv
CountdownRewardParser
DatasetChatEnv
GSM8KEnv
make_gsm8k_env
GSM8KPrepareQuestion
GSM8KRewardParser
IFEvalEnv
IfEvalScorer
IFEvalScoreData
MATHEnv
MATHRewardParser
LLMEnv
LLMHashingEnv
make_mlgym
MLGymWrapper