You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Axolotl is a powerful tool for fine-tuning a wide range of AI models, supporting techniques like LoRA and QLoRA through simple YAML configurations.
224
+
225
+
The [Atropos plugin for Axolotl](https://github.com/axolotl-ai-cloud/plugin-atropos) seamlessly integrates Atropos' RL environments into Axolotl's training pipelines.
226
+
This allows you to leverage Atropos for reinforcement learning while utilizing Axolotl's extensive features for model fine-tuning.
227
+
228
+
To use, follow the readme on the [plugin repository](https://github.com/axolotl-ai-cloud/plugin-atropos).
229
+
230
+
## Atropos' Example Trainer
231
+
Atropos repo contains an example trainer that should primarily be used as a reference example to show how a trainer and inference provider can be integrated with Atropos to complete the RL Training Loop.
232
+
233
+
To use the example trainer, see this page: [training example guide](example_trainer/README.md)
234
+
235
+
---
236
+
214
237
## Testing and Debugging Tools
215
238
216
239
The trajectory-handler provides several debugging tools to help environment developers test and understand their environments locally without requiring the full distributed infrastructure.
0 commit comments