Hi, thanks for maintaining this awesome list!
I would like to recommend adding the following recent RL-based Agent paper:
-
Title: AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning
-
Intro: AutoTIR, a reinforcement learning framework that enables LLMs to autonomously decide whether and which tool (e.g. Search tool and Code tool) to invoke during the reasoning process, rather than following static tool-use strategies.
-
Paper: https://arxiv.org/pdf/2507.21836
-
Code: https://github.com/weiyifan1023/AutoTIR
-
Base LLM: Qwen2.5-7B-Instruct