@@ -155,12 +155,65 @@ Welcome to our comprehensive resource collection for AI agents. This page curate
155155** P045** - Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents
156156 - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.06214 )
157157
158- ** P046** - In-the-Flow Agentic System Optimization: Effective Planning and Tool Use
158+ ** P046** - AgentFlow: In-the-Flow Agentic System Optimization: Effective Planning and Tool Use
159159 - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.05592 ) · [ :material-github: GitHub] ( https://github.com/lupantech/AgentFlow )
160160
161161** P047** - ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
162162 - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.05746 )
163163
164+ ** P048** - Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
165+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.07230 )
166+
167+ ** P049** - CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
168+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08529 )
169+
170+ ** P050** - Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
171+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08276 )
172+
173+ ** P051** - Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
174+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08002 )
175+
176+ ** P052** - MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
177+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08567 )
178+
179+ ** P053** - Agent Learning via Early Experience
180+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08558 )
181+
182+ ** P054** - CaRT: Teaching LLM Agents to Know When They Know Enough
183+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08517 )
184+
185+ ** P055** - AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
186+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08511 )
187+
188+ ** P056** - Opponent Shaping in LLM Agents
189+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08255 )
190+
191+ ** P057** - NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
192+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.08173 )
193+
194+ ** P058** - VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
195+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.07978 )
196+
197+ ** P059** - Self-Improving LLM Agents at Test-Time
198+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.07841 )
199+
200+ ** P060** - AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
201+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.04206 )
202+
203+ ** P061** - Adaptive Tool Generation with Models as Tools and Reinforcement Learning
204+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.06825 )
205+
206+ ** P062** - TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
207+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.06579 )
208+
209+ ** P063** - A Survey on Agentic Security: Applications, Threats and Defenses
210+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.06445 )
211+
212+ ** P064** - A Multi-Agent Framework for Stateful Inference-Time Search
213+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.07147 )
214+
215+ ** P065** - AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
216+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.06261 )
164217
165218
166219---
@@ -277,6 +330,9 @@ Welcome to our comprehensive resource collection for AI agents. This page curate
277330** E027** - ACEBench: Who Wins the Match Point in Tool Usage?
278331 - [ :material-file-document: Paper] ( https://arxiv.org/abs/2501.12851 )
279332
333+ ** E028** - Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation
334+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.07414 ) · [ :material-github: GitHub] ( https://github.com/Graph-COM/HaystackCraft )
335+
280336
281337---
282338
@@ -327,6 +383,9 @@ Welcome to our comprehensive resource collection for AI agents. This page curate
327383** M014** - A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
328384 - [ :material-file-document: Paper] ( https://www.arxiv.org/abs/2510.02373 )
329385
386+ ** M015** - ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
387+ - [ :material-file-document: Paper] ( https://arxiv.org/abs/2510.06664 )
388+
330389
331390---
332391
0 commit comments