Agents

1. Overview

Agents (also known as AI Agents or Autonomous Agents) are AI systems designed to perceive their environment, reason about goals, make decisions, and take actions autonomously to achieve specified objectives. Unlike traditional software that follows predetermined instructions, agents exhibit goal-directed behavior, adaptability, and autonomy in dynamic environments.

The core idea is simple:

The agent perceives its environment through sensors or inputs
It reasons about goals and plans actions using internal models
It acts upon the environment through actuators or outputs
It learns and adapts its behavior based on feedback and experience

Agents are particularly well-suited for problems involving complex decision-making, uncertainty, multi-step reasoning, and human-AI collaboration.

2. Core Concepts

Agent

An autonomous entity that perceives, reasons, and acts to achieve goals. Examples: chatbots, robotic systems, personal assistants, game NPCs, trading algorithms.

Environment

The context in which the agent operates. Can be physical (robotics), digital (software systems), or hybrid (smart homes).

Perception

The agent's ability to gather information about its environment through sensors, APIs, databases, or user inputs.

Reasoning

The cognitive process of analyzing information, planning actions, and making decisions. May involve symbolic logic, neural networks, or hybrid approaches.

Action

The agent's ability to influence its environment through actuators, API calls, commands, or generated outputs.

Goals

Objectives the agent is designed to achieve. Can be explicit (solve a math problem) or implicit (maximize user satisfaction).

Autonomy

The degree to which an agent operates independently without human intervention.

Reactivity vs Deliberation

Reactive agents respond immediately to environmental stimuli. Deliberative agents plan and reason before acting.

Memory

The agent's ability to store and recall past experiences, conversations, or learned knowledge.

3. The Agent Loop

The fundamental cycle of agent operation:

Perceive: Gather information from the environment (sensors, inputs, context)
Reason: Process information, evaluate goals, consider options
Plan: Determine sequence of actions to achieve objectives
Act: Execute chosen action(s) in the environment
Observe: Monitor results and feedback from actions
Learn: Update internal models, strategies, or knowledge
Repeat: Continue the cycle until goals are achieved or terminated

This loop may execute in milliseconds (reactive agents) or over extended periods (deliberative agents).

4. Types of Agents

Simple Reflex Agents

Respond directly to current percepts using condition-action rules. No memory or planning.

Fast and efficient for simple, well-defined tasks
Limited to fully observable environments
Example: Thermostat, basic chatbot patterns

Model-Based Reflex Agents

Maintain internal state representing the world. Handle partially observable environments.

Track environment state over time
More robust than simple reflex agents
Example: Robot navigation with SLAM

Goal-Based Agents

Reason about goals and plan action sequences to achieve them.

Consider future consequences of actions
Can adapt to changing goals
Example: GPS navigation systems, AI planners

Utility-Based Agents

Maximize a utility function representing preferences over outcomes.

Handle conflicting goals and trade-offs
Can optimize for complex objectives
Example: Recommendation systems, resource allocators

Learning Agents

Improve performance over time through experience.

Adapt to new situations without reprogramming
Discover optimal strategies through trial and error
Example: Reinforcement learning agents, adaptive AI assistants

Multi-Agent Systems

Multiple agents interacting, cooperating, or competing.

Distributed problem-solving
Emergent collective behavior
Example: Swarm robotics, multi-agent simulations, collaborative AI teams

5. Simple Example (Intuition)

Personal Email Assistant Agent:

Perception: Reads incoming emails, calendar, task list
Reasoning: Identifies important messages, scheduling requests, actionable items
Planning: Determines which emails need replies, which to archive, which to escalate
Action: Drafts responses, creates calendar events, sets reminders
Learning: Adapts to user preferences over time (writing style, priority rules)

The agent doesn't just filter emails—it understands context, anticipates needs, and takes proactive actions.

Key insight: The agent exhibits goal-directed behavior (manage communications efficiently) without being explicitly programmed for every scenario.

6. Agent Architectures

Symbolic/Logic-Based Agents

Use explicit knowledge representation and logical reasoning.

Pros: Interpretable, verifiable, good for structured domains
Cons: Brittle, difficult to scale, struggles with uncertainty
Example: Expert systems, STRIPS planners, rule-based chatbots

Reactive Agents

No internal model, direct stimulus-response mappings.

Pros: Fast, robust, simple to implement
Cons: Limited reasoning, no planning, purely reactive
Example: Subsumption architecture (Brooks), behavior-based robotics

Deliberative Agents (BDI Architecture)

Belief-Desire-Intention model: agents have beliefs (world model), desires (goals), and intentions (committed plans).

Beliefs: What the agent knows about the world
Desires: What the agent wants to achieve
Intentions: What the agent has committed to doing
Example: Rational agents in complex planning tasks

Hybrid Architectures

Combine reactive and deliberative layers.

Low-level reactive behaviors for immediate responses
High-level deliberative planning for strategic goals
Example: Three-layer architectures (reactive, executive, deliberative)

Embodied Agents

Physical or simulated robots with sensory and motor capabilities.

Grounded in physical interaction with environments
Subject to real-world constraints (physics, time, resources)
Example: Humanoid robots, drones, autonomous vehicles

Conversational Agents

Natural language interfaces for human-AI interaction.

Understand user intent through language
Generate contextual responses
Maintain conversation state and memory
Example: ChatGPT, virtual assistants, customer service bots

7. Modern LLM-Based Agents

Large Language Models as Agent Brains:

LLMs have revolutionized agent design by providing:

Natural language understanding and generation
Reasoning capabilities through prompt engineering
Tool-use through function calling
Memory through context windows and external storage

Key Components of LLM Agents

Reasoning Engine: LLM processes inputs and generates plans

Tools/Actions: Functions the agent can call (search, calculate, execute code, query databases)

Memory Systems:

Short-term: Conversation context window
Long-term: Vector databases, knowledge graphs, external storage

Planning Strategies:

Zero-shot: Direct task execution
Chain-of-Thought: Step-by-step reasoning
ReAct: Reason + Act cycles
Tree-of-Thoughts: Explore multiple reasoning paths

Feedback Loops: Self-reflection, error correction, iterative refinement

Common Patterns

Function Calling: LLM decides which tools to invoke

ReAct (Reason + Act): Interleave reasoning and action steps

Chain-of-Thought: Explicit step-by-step reasoning before action

Self-Ask: Agent asks itself clarifying questions

Reflection: Agent critiques and improves its own outputs

Multi-Agent Collaboration: Specialized agents working together

8. Agent Components and Capabilities

Perception Systems

Natural language inputs (text, speech)
Structured data (APIs, databases, sensors)
Visual inputs (computer vision)
Multimodal fusion

Memory Systems

Working Memory: Current conversation/task context
Episodic Memory: Past interactions and experiences
Semantic Memory: General knowledge and facts
Procedural Memory: Skills and learned behaviors

Tool Use

Web search and information retrieval
Code execution and interpretation
Mathematical computation
File system operations
API interactions
Database queries

Planning and Reasoning

Task decomposition
Sub-goal identification
Constraint satisfaction
Causal reasoning
Common-sense reasoning
Uncertainty handling

Communication

Natural language generation
Explanation and justification
Clarification questions
Status reporting
Multi-turn dialogue

9. Key Research Papers

Foundational Papers

STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving — Fikes & Nilsson (1971) Classic planning system that influenced agent design.
A Formal Theory of Communicative Action and Plan Recognition — Cohen & Perrault (1979) Foundations of rational agents and speech acts.
Intelligence Without Representation — Brooks (1991) Reactive agents and behavior-based robotics.

Multi-Agent Systems

The Beliefs-Desires-Intentions Model of Agency — Rao & Georgeff (1995) BDI architecture for rational agents.
Communicating Agents in a Simulated Environment — Contract Net Protocol Coordination in multi-agent systems.
Emergence of Conventions in Multi-Agent Systems — Shoham & Tennenholtz (1997) How agents develop shared norms.

Modern LLM-Based Agents

ReAct: Synergizing Reasoning and Acting in Language Models — Yao et al. (2022) Interleaving reasoning traces with actions for improved agent performance.
Toolformer: Language Models Can Teach Themselves to Use Tools — Schick et al. (2023) Self-supervised tool use learning in LLMs.
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models — Wei et al. (2022) Foundation for reasoning in LLM agents.
Tree of Thoughts: Deliberate Problem Solving with Large Language Models — Yao et al. (2023) Structured exploration of reasoning paths.
Reflexion: Language Agents with Verbal Reinforcement Learning — Shinn et al. (2023) Agents learning from verbal feedback.
Generative Agents: Interactive Simulacra of Human Behavior — Park et al. (2023) Believable agent behaviors in simulated environments.
AutoGPT and MetaGPT: Open-source autonomous agent frameworks Practical implementations of multi-agent systems.

10. Foundational Books

Artificial Intelligence: A Modern Approach — Russell & Norvig (4th ed.)

Chapters 2-3: Intelligent Agents
Comprehensive coverage of agent types and architectures
Standard textbook for AI agents

Multi-Agent Systems — Wooldridge (2009)

Definitive guide to multi-agent theory
Game theory and coordination mechanisms
Practical agent programming

An Introduction to MultiAgent Systems — Wooldridge (2nd ed.)

Accessible introduction to agent concepts
BDI architectures and agent communication

Programming Multi-Agent Systems in AgentSpeak using Jason — Bordini et al. (2007)

Practical agent programming
BDI agent implementation

Autonomous Agents: Theory and Practice — Jennings & Wooldridge (1998)

Classic collection on agent foundations
Still relevant for understanding agent principles

11. Learning Resources (Free & High Quality)

Courses

Stanford CS224V – Conversational Virtual Assistants Modern LLM-based agents and dialogue systems.
Berkeley CS188 – Introduction to Artificial Intelligence Foundational agent concepts, search, and planning.
MIT 6.034 – Artificial Intelligence Agent architectures and reasoning.
DeepLearning.AI – LangChain for LLM Application Development Building LLM agents with popular frameworks.
DeepLearning.AI – Functions, Tools and Agents with LangChain Advanced agent patterns and tool use.

Tutorials & Guides

LangChain Documentation Comprehensive guide to building LLM agents.
LlamaIndex Agent Guides Multi-modal agent construction.
OpenAI Function Calling Guide Tool use patterns for GPT models.
Anthropic Claude Function Calling Tool use with Claude models.
HuggingFace Transformers Agents Open-source agent frameworks.

Libraries & Frameworks

LangChain — Popular framework for LLM agents with extensive tool ecosystem
LangGraph — Graph-based agent state management and workflows
AutoGPT — Autonomous agent with internet access and tool use
BabyAGI — Simple autonomous agent template
AgentGPT — Browser-based autonomous agent platform
CrewAI — Role-based multi-agent orchestration
Semantic Kernel (Microsoft) — Enterprise agent framework
Haystack — NLP framework with agent capabilities
Rasa — Conversational AI and chatbot framework
OpenAI Swarm — Educational framework for multi-agent orchestration

12. Common Applications

Virtual Assistants: Siri, Alexa, Google Assistant, ChatGPT
Customer Service Bots: Automated support and ticket resolution
Code Assistants: GitHub Copilot, Cursor, Aider
Research Assistants: Literature review, data analysis, report generation
Trading Bots: Algorithmic trading and portfolio management
Game NPCs: Believable characters with adaptive behaviors
Robotic Systems: Autonomous vehicles, drones, industrial robots
Smart Home Systems: Automated control and optimization
Personal Productivity: Email management, scheduling, task automation
Educational Tutors: Personalized learning and adaptive instruction

Agents are especially powerful where autonomous decision-making, adaptive behavior, and complex task execution are required.

13. Practical Advice for Building Agents

Start simple: Begin with single-task agents before multi-agent systems
Define clear goals: Ambiguous objectives lead to unpredictable behavior
Design robust tool interfaces: Validate inputs, handle errors gracefully
Implement memory carefully: Balance context retention with cost
Test with diverse inputs: Agents must handle unexpected user behavior
Monitor and log extensively: Understand agent decision-making processes
Use structured outputs: JSON schemas for reliable tool calling
Implement safety guardrails: Prevent harmful or unintended actions
Human-in-the-loop: Critical decisions should involve human oversight
Iterate on prompts: Agent behavior heavily depends on system instructions

14. Common Pitfalls

Over-autonomy: Agents taking actions beyond intended scope
Hallucinated tool calls: LLMs inventing non-existent functions
Context window limitations: Losing important information in long conversations
Tool reliability: External APIs failing or returning unexpected data
Error cascades: One failed action leading to chain of failures
Prompt injection: Users manipulating agent behavior through clever inputs
Cost explosion: Agents making excessive API calls
Goal misalignment: Agent optimizing for wrong objective
Lack of transparency: Users not understanding why agent behaved a certain way
Determinism vs creativity: Balancing reliable behavior with adaptive responses

15. Agent Design Patterns

ReAct Pattern

Thought → Action → Observation loop

Thought: I need to find recent news about AI agents
Action: search("AI agents news 2024")
Observation: Found 10 articles about LLM-based agents
Thought: I should read the most recent article
Action: read_url("https://...")
Observation: Article discusses tool use in agents

Tool Chain Pattern

Sequential tool execution with data passing

1. search_database(query) → results
2. filter_results(results, criteria) → filtered
3. generate_report(filtered) → final_output

Self-Refinement Pattern

Generate → Critique → Improve cycle

1. Generate initial response
2. Critique response for errors/improvements
3. Regenerate improved version
4. Repeat until quality threshold met

Multi-Agent Collaboration

Specialized agents with distinct roles

- Researcher Agent: Gathers information
- Analyst Agent: Processes and analyzes data
- Writer Agent: Generates final output
- Critic Agent: Reviews and validates quality

Memory-Augmented Pattern

Agents with external memory systems

- Store conversation history in vector DB
- Retrieve relevant past interactions
- Use retrieved context in current reasoning
- Update memory with new information

16. How Agents Connect to Modern AI Systems

RAG Systems: Agents orchestrating retrieval and generation
Multi-Modal AI: Agents processing vision, text, and speech
Reinforcement Learning: Training agents through reward feedback (RLHF)
Tool Learning: Agents discovering and adapting to new tools
Emergent Abilities: Complex behaviors arising from simple agent interactions
Human-AI Collaboration: Agents as co-workers and assistants
Autonomous Systems: Self-driving cars, drones, robotic systems

Agents represent the next evolution of AI systems—from passive prediction models to active, goal-directed participants in complex tasks.

17. Suggested Next Steps (Hands-on Mini Projects)

Each step is intentionally small and self-contained. These can each live in their own folder or repository.

Project 1: Simple Reflex Agent

Goal: Build intuition for perception-action mapping.

Create a text-based agent that responds to keywords
Implement condition-action rules (if-then logic)
No memory or planning—purely reactive
Example: Simple chatbot with predefined responses
Output: Agent that can handle 10 different commands

Project 2: Calculator Agent with Tool Use

Goal: Understand tool-calling fundamentals.

Agent can perform math operations through function calls
Implement 5 tools: add, subtract, multiply, divide, power
Parse user request to identify operation and arguments
Return structured results
No LLM required—use pattern matching or simple NLU

Project 3: ReAct Agent with OpenAI

Goal: Implement the ReAct pattern with an LLM.

System prompt defining Thought-Action-Observation loop
Implement 3 tools: web_search, calculator, get_weather
Parse LLM output to extract tool calls
Feed observations back to LLM
Limit to 5 reasoning steps maximum
Test with multi-step questions

Project 4: Memory-Enabled Conversational Agent

Goal: Add persistent memory to an agent.

Store conversation history in simple database (JSON or SQLite)
Retrieve relevant past interactions based on current query
Inject retrieved memories into LLM context
Track user preferences and facts over time
Test multi-session continuity

Project 5: Multi-Tool Research Agent

Goal: Build an agent that can research topics autonomously.

Tools: web_search, read_url, extract_text, summarize
Agent breaks down research task into subtasks
Gathers information from multiple sources
Synthesizes findings into coherent report
Track citations and sources

Project 6: Multi-Agent System

Goal: Create specialized agents that collaborate.

Define 3 agents: Researcher, Analyst, Writer
Each agent has specific role and capabilities
Agents pass information sequentially or in parallel
Implement agent-to-agent communication protocol
Example task: "Research and write blog post about X"

Project 7: Self-Reflective Agent

Goal: Implement agent that critiques and improves outputs.

Generate initial response to user query
Use second LLM call to critique response
Identify weaknesses or missing information
Regenerate improved response
Compare before/after quality
Implement iterative refinement (max 3 iterations)

Project 8: Goal-Based Planning Agent

Goal: Agent decomposes complex goals into subgoals.

User provides high-level goal (e.g., "Plan a dinner party")
Agent breaks into subtasks (guest list, menu, shopping, timeline)
Tracks completion status of each subtask
Adapts plan based on constraints and feedback
Visualize task dependency graph

Project 9: Safe Agent with Guardrails

Goal: Learn to constrain agent behavior.

Define allowed and forbidden actions
Implement safety checks before execution
Detect prompt injection attempts
Set spending/rate limits on tool use
Log all actions for audit trail
Test with adversarial inputs

Project 10: Autonomous Task Executor

Goal: Build a production-ready autonomous agent.

Agent accepts natural language task descriptions
Plans multi-step execution strategy
Executes with error handling and retries
Provides progress updates to user
Handles failures gracefully
Example: "Analyze this dataset and create visualizations"

Project 11: Agent Evaluation Framework

Goal: Systematically measure agent performance.

Create test suite with diverse tasks
Measure success rate, efficiency (tool calls), cost
Track reasoning quality and accuracy
Compare different prompting strategies
Identify failure modes and edge cases
Visualize agent behavior patterns

Project 12: RAG Agent with Dynamic Retrieval

Goal: Combine retrieval and agent reasoning.

Agent decides when retrieval is needed
Implements multiple retrieval strategies
Synthesizes information from multiple documents
Cites sources in responses
Handles "I don't know" gracefully
Example: Agent for technical documentation

Project 13: Read-and-Reproduce

Goal: Learn by replication.

Pick one paper (ReAct, Reflexion, or Tree-of-Thoughts)
Reproduce simplified version with modern LLMs
Test on benchmark tasks from the paper
Document what worked vs what broke
Write comparison of paper results vs your implementation

Deep agent understanding comes from building systems that perceive, reason, and act autonomously.

Generation Metadata

Created: January 10, 2026 Research Assistant Version: Specialized AI Documentation Agent Primary Sources: 25+ academic papers, 8 books, 12 courses, 15+ technical resources

Key References:

Russell & Norvig, "Artificial Intelligence: A Modern Approach" (4th ed.)
Yao et al., "ReAct: Synergizing Reasoning and Acting in Language Models" (2022)
Wooldridge, "An Introduction to MultiAgent Systems" (2nd ed.)

Research Methodology:

Literature review: Comprehensive survey of agent architectures from symbolic AI (1970s) through modern LLM-based agents (2023-2026)
Source verification: Cross-referenced multiple foundational texts and recent papers
Expert consultation: Reviewed implementations in LangChain, LlamaIndex, and OpenAI documentation

Last Updated: January 10, 2026 Maintainer: Research Assistant Agent

FilesExpand file tree

agents.md

Latest commit

History

agents.md

File metadata and controls

Agents

1. Overview

2. Core Concepts

Agent

Environment

Perception

Reasoning

Action

Goals

Autonomy

Reactivity vs Deliberation

Memory

3. The Agent Loop

4. Types of Agents

Simple Reflex Agents

Model-Based Reflex Agents

Goal-Based Agents

Utility-Based Agents

Learning Agents

Multi-Agent Systems

5. Simple Example (Intuition)

6. Agent Architectures

Symbolic/Logic-Based Agents

Reactive Agents

Deliberative Agents (BDI Architecture)

Hybrid Architectures

Embodied Agents

Conversational Agents

7. Modern LLM-Based Agents

Key Components of LLM Agents

Common Patterns

8. Agent Components and Capabilities

Perception Systems

Memory Systems

Tool Use

Planning and Reasoning

Communication

9. Key Research Papers

Foundational Papers

Multi-Agent Systems

Modern LLM-Based Agents

10. Foundational Books

11. Learning Resources (Free & High Quality)

Courses

Tutorials & Guides

Libraries & Frameworks

12. Common Applications

13. Practical Advice for Building Agents

14. Common Pitfalls

15. Agent Design Patterns

ReAct Pattern

Tool Chain Pattern

Self-Refinement Pattern

Multi-Agent Collaboration

Memory-Augmented Pattern

16. How Agents Connect to Modern AI Systems

17. Suggested Next Steps (Hands-on Mini Projects)

Project 1: Simple Reflex Agent

Project 2: Calculator Agent with Tool Use

Project 3: ReAct Agent with OpenAI

Project 4: Memory-Enabled Conversational Agent

Project 5: Multi-Tool Research Agent

Project 6: Multi-Agent System

Project 7: Self-Reflective Agent

Project 8: Goal-Based Planning Agent

Project 9: Safe Agent with Guardrails

Project 10: Autonomous Task Executor

Project 11: Agent Evaluation Framework

Project 12: RAG Agent with Dynamic Retrieval

Project 13: Read-and-Reproduce

Generation Metadata