inclusionAI
diff --git a/‎README.md‎
Lines changed: 10 additions & 9 deletions b/‎README.md‎
Lines changed: 10 additions & 9 deletions
diff --git a/‎examples/math/multi-turn/README.md‎ ‎examples/multi-turn-math/README.md‎examples/math/multi-turn/README.md renamed to examples/multi-turn-math/README.md b/‎examples/math/multi-turn/README.md‎ ‎examples/multi-turn-math/README.md‎examples/math/multi-turn/README.md renamed to examples/multi-turn-math/README.md
diff --git a/‎examples/math/multi-turn/config.yaml‎ ‎examples/multi-turn-math/config.yaml‎examples/math/multi-turn/config.yaml renamed to examples/multi-turn-math/config.yaml b/‎examples/math/multi-turn/config.yaml‎ ‎examples/multi-turn-math/config.yaml‎examples/math/multi-turn/config.yaml renamed to examples/multi-turn-math/config.yaml
diff --git a/‎examples/math/multi-turn/reward_curve.png‎ ‎examples/multi-turn-math/reward_curve.png‎examples/math/multi-turn/reward_curve.png renamed to examples/multi-turn-math/reward_curve.png b/‎examples/math/multi-turn/reward_curve.png‎ ‎examples/multi-turn-math/reward_curve.png‎examples/math/multi-turn/reward_curve.png renamed to examples/multi-turn-math/reward_curve.png
diff --git a/‎examples/math/multi-turn/train.py‎ ‎examples/multi-turn-math/train.py‎examples/math/multi-turn/train.py renamed to examples/multi-turn-math/train.py b/‎examples/math/multi-turn/train.py‎ ‎examples/multi-turn-math/train.py‎examples/math/multi-turn/train.py renamed to examples/multi-turn-math/train.py
@@ -71,15 +71,16 @@ state-of-the-art 7B and 32B models for mathematical reasoning. Check out our
 
 ## 📚 Examples
 
-| Task                                           | Description                                                                          | Performance                                                                       |
-| ---------------------------------------------- | ------------------------------------------------------------------------------------ | --------------------------------------------------------------------------------- |
-| **[Math](examples/math/)**                     | Mathematical problem solving (SFT, GRPO, or PPO)                                     | TBA                                                                               |
-| **[LoRA Math](examples/lora/)**                | Math Agent Trained With LoRA                                                         | TBA                                                                               |
-| **[VLM Math](examples/vlm/)**                  | CLEVR visual counting tasks                                                          | TBA                                                                               |
-| **[Reasoning](examples/countdown/)**           | Countdown numbers game with custom rewards                                           | [Training Curve](/examples/countdown/countdown_training_curve.png)                |
-| **[Search Agent](examples/search-agent/)**     | An agent with end-to-end reasoning, search, browsing, and summarization capabilities | [ASearcher Repo](https://github.com/inclusionAI/ASearcher)                        |
-| **[Tool-Integrated Reasoning](examples/tir/)** | An agent that can invoke tools during reasoning                                      | [TIR Example](https://github.com/inclusionAI/AReaL/tree/main/examples/tir)        |
-| **[RLHF](examples/alignment/)**                | RLHF for LLM Alignment                                                               | [RLHF Example](https://github.com/inclusionAI/AReaL/tree/main/examples/alignment) |
+| Task                                             | Description                                                                          | Performance                                                                       |
+| ------------------------------------------------ | ------------------------------------------------------------------------------------ | --------------------------------------------------------------------------------- |
+| **[Math](examples/math/)**                       | Mathematical problem solving (SFT, GRPO, or PPO)                                     | TBA                                                                               |
+| **[Multi-Turn Math](examples/multi-turn-math/)** | Iterative mathematical problem solving with self-correction                          | [Training Curve](examples/multi-turn-math/reward_curve.png)     |
+| **[LoRA Math](examples/lora/)**                  | Math Agent Trained With LoRA                                                         | TBA                                                                               |
+| **[VLM Math](examples/vlm/)**                    | CLEVR visual counting tasks                                                          | TBA                                                                               |
+| **[Reasoning](examples/countdown/)**             | Countdown numbers game with custom rewards                                           | [Training Curve](/examples/countdown/countdown_training_curve.png)                |
+| **[Search Agent](examples/search-agent/)**       | An agent with end-to-end reasoning, search, browsing, and summarization capabilities | [ASearcher Repo](https://github.com/inclusionAI/ASearcher)                        |
+| **[Tool-Integrated Reasoning](examples/tir/)**   | An agent that can invoke tools during reasoning                                      | [TIR Example](https://github.com/inclusionAI/AReaL/tree/main/examples/tir)        |
+| **[RLHF](examples/alignment/)**                  | RLHF for LLM Alignment                                                               | [RLHF Example](https://github.com/inclusionAI/AReaL/tree/main/examples/alignment) |
 
 ## 🔧 Support Matrix