Skip to content

Commit 4dd383b

Browse files
committed
add papers and update index
1 parent 648f6cf commit 4dd383b

File tree

2 files changed

+77
-1
lines changed

2 files changed

+77
-1
lines changed

docs/mkdocs/docs/all_about_agents.md

Lines changed: 60 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,12 +155,65 @@ Welcome to our comprehensive resource collection for AI agents. This page curate
155155
**P045** - Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents
156156
- [:material-file-document: Paper](https://arxiv.org/abs/2510.06214)
157157

158-
**P046** - In-the-Flow Agentic System Optimization: Effective Planning and Tool Use
158+
**P046** - AgentFlow: In-the-Flow Agentic System Optimization: Effective Planning and Tool Use
159159
- [:material-file-document: Paper](https://arxiv.org/abs/2510.05592) · [:material-github: GitHub](https://github.com/lupantech/AgentFlow)
160160

161161
**P047** - ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
162162
- [:material-file-document: Paper](https://arxiv.org/abs/2510.05746)
163163

164+
**P048** - Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
165+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.07230)
166+
167+
**P049** - CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
168+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08529)
169+
170+
**P050** - Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
171+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08276)
172+
173+
**P051** - Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
174+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08002)
175+
176+
**P052** - MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
177+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08567)
178+
179+
**P053** - Agent Learning via Early Experience
180+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08558)
181+
182+
**P054** - CaRT: Teaching LLM Agents to Know When They Know Enough
183+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08517)
184+
185+
**P055** - AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
186+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08511)
187+
188+
**P056** - Opponent Shaping in LLM Agents
189+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08255)
190+
191+
**P057** - NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
192+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.08173)
193+
194+
**P058** - VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
195+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.07978)
196+
197+
**P059** - Self-Improving LLM Agents at Test-Time
198+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.07841)
199+
200+
**P060** - AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
201+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.04206)
202+
203+
**P061** - Adaptive Tool Generation with Models as Tools and Reinforcement Learning
204+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.06825)
205+
206+
**P062** - TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
207+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.06579)
208+
209+
**P063** - A Survey on Agentic Security: Applications, Threats and Defenses
210+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.06445)
211+
212+
**P064** - A Multi-Agent Framework for Stateful Inference-Time Search
213+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.07147)
214+
215+
**P065** - AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
216+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.06261)
164217

165218

166219
---
@@ -277,6 +330,9 @@ Welcome to our comprehensive resource collection for AI agents. This page curate
277330
**E027** - ACEBench: Who Wins the Match Point in Tool Usage?
278331
- [:material-file-document: Paper](https://arxiv.org/abs/2501.12851)
279332

333+
**E028** - Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation
334+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.07414) · [:material-github: GitHub](https://github.com/Graph-COM/HaystackCraft)
335+
280336

281337
---
282338

@@ -327,6 +383,9 @@ Welcome to our comprehensive resource collection for AI agents. This page curate
327383
**M014** - A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
328384
- [:material-file-document: Paper](https://www.arxiv.org/abs/2510.02373)
329385

386+
**M015** - ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
387+
- [:material-file-document: Paper](https://arxiv.org/abs/2510.06664)
388+
330389

331390
---
332391

docs/mkdocs/docs/index.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -53,6 +53,23 @@ Explore the complete MiroMind AI ecosystem:
5353
| **MiroTrain** | Complete training recipes and tools | [GitHub](https://github.com/MiroMindAI/MiroTrain) :material-arrow-right: |
5454

5555

56+
## 📝 Recent Updates
57+
58+
!!! success "Latest Changes & Improvements"
59+
60+
**Oct 2025** -
61+
62+
- 📊 Added support for FinSearchComp evaluation benchmark [#51](https://github.com/MiroMindAI/MiroFlow/pull/51)
63+
- 🔍 Added support for XBench-DS (Deep Search) evaluation [#47](https://github.com/MiroMindAI/MiroFlow/pull/47)
64+
- 🧠 Updated o3 hints and summary to more models [#58](https://github.com/MiroMindAI/MiroFlow/pull/58)
65+
- ✨ Added support for GPT-5 integration [#52](https://github.com/MiroMindAI/MiroFlow/pull/52)
66+
- 🔧 Improved tool logs and per-task log storage [#69](https://github.com/MiroMindAI/MiroFlow/pull/69)
67+
- 🤖 Added support for single agent mode [#67](https://github.com/MiroMindAI/MiroFlow/pull/67)
68+
- 📚 Added comprehensive collection of agentic AI research papers [#65](https://github.com/MiroMindAI/MiroFlow/pull/65)
69+
70+
71+
72+
5673

5774
---
5875

0 commit comments

Comments
 (0)