victordibia
diff --git a/‎README.md‎
Lines changed: 11 additions & 10 deletions b/‎README.md‎
Lines changed: 11 additions & 10 deletions
diff --git a/‎examples/agents/software_engineer_agent.py‎
Lines changed: 30 additions & 0 deletions b/‎examples/agents/software_engineer_agent.py‎
Lines changed: 30 additions & 0 deletions
@@ -33,30 +33,31 @@ The book is organized across 4 parts, taking you from theory to production:
 | -------- | -------------------------------------------- | --------------------------------------------------------------------------------- | -------------------------------------------------------- |
 | **Ch 1** | Understanding Multi-Agent Systems            | Poet/critic example, references [`yc_analysis/`](examples/workflows/yc_analysis/) | Understand when multi-agent systems are needed           |
 | **Ch 2** | Multi-Agent Patterns                         | -                                                                                 | Master coordination strategies (workflows vs autonomous) |
-| **Ch 3** | UX Design Principles for Multi-Agent Systems | -                                                                                 | Build intuitive agent interfaces                         |
+| **Ch 3** | UX Design Principles for Multi-Agent Systems | -                                                                                 | Principles for building intuitive agent interfaces       |
 
 ### Part II: Building Multi-Agent Systems from Scratch
 
 | Chapter  | Title                                 | Code                                                                                                                                                                                                                                                                                                                                                                                                                                    | Learning Outcome                                                                          |
 | -------- | ------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------- |
-| **Ch 4** | Building Your First Agent             | [`agents/_agent.py`](picoagents/src/picoagents/agents/_agent.py), [`basic-agent.py`](examples/agents/basic-agent.py), [`memory.py`](examples/agents/memory.py), [`middleware.py`](examples/agents/middleware.py) <br> [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/victordibia/designing-multiagent-systems/blob/main/examples/notebooks/01_basic_agent.ipynb) | Build agents with tools, memory, streaming, and middleware                    |
+| **Ch 4** | Building Your First Agent             | [`agents/_agent.py`](picoagents/src/picoagents/agents/_agent.py), [`basic-agent.py`](examples/agents/basic-agent.py), [`memory.py`](examples/agents/memory.py), [`middleware.py`](examples/agents/middleware.py) <br> [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/victordibia/designing-multiagent-systems/blob/main/examples/notebooks/01_basic_agent.ipynb) | Build agents with tools, memory, streaming, and middleware                                |
 | **Ch 5** | Computer Use Agents                   | [`agents/_computer_use/`](picoagents/src/picoagents/agents/_computer_use/), [`computer_use.py`](examples/agents/computer_use.py)                                                                                                                                                                                                                                                                                                        | Build browser automation agents with multimodal reasoning                                 |
-| **Ch 5** | Building Multi-Agent Workflows        | [`workflow/`](picoagents/src/picoagents/workflow/), [`data_visualization/`](examples/workflows/data_visualization/)                                                                                                                                                                                                                                                                                                                     | Build type-safe workflows with streaming observability                                    |
-| **Ch 6** | Autonomous Multi-Agent Orchestration  | [`orchestration/`](picoagents/src/picoagents/orchestration/), [`round-robin.py`](examples/orchestration/round-robin.py), [`ai-driven.py`](examples/orchestration/ai-driven.py), [`plan-based.py`](examples/orchestration/plan-based.py)                                                                                                                                                                                                 | Implement GroupChat, LLM-driven, and plan-based orchestration (Magentic One patterns)     |
-| **Ch 6** | Building Modern Agent UX Applications | [`webui/`](picoagents/src/picoagents/webui/), CLI tools                                                                                                                                                                                                                                                                                                                                                                                 | Build interactive agent applications with web UI, auto-discovery, and real-time streaming |
-| **Ch 6** | Multi-Agent Frameworks                | -                                                                                                                                                                                                                                                                                                                                                                                                                                       | Evaluate and choose the right multi-agent framework                                       |
+| **Ch 6** | Building Multi-Agent Workflows        | [`workflow/`](picoagents/src/picoagents/workflow/), [`workflows/`](examples/workflows/)                                                                                                                                                                                                                                                                                                                                                 | Build type-safe workflows with streaming observability                                    |
+| **Ch 7** | Autonomous Multi-Agent Orchestration  | [`orchestration/`](picoagents/src/picoagents/orchestration/), [`round-robin.py`](examples/orchestration/round-robin.py), [`ai-driven.py`](examples/orchestration/ai-driven.py), [`plan-based.py`](examples/orchestration/plan-based.py)                                                                                                                                                                                                 | Implement GroupChat, LLM-driven, and plan-based orchestration (Magentic One patterns)     |
+| **Ch 8** | Building Modern Agent UX Applications | [`webui/`](picoagents/src/picoagents/webui/), CLI tools                                                                                                                                                                                                                                                                                                                                                                                 | Build interactive agent applications with web UI, auto-discovery, and real-time streaming |
+| **Ch 9** | Multi-Agent Frameworks                | -                                                                                                                                                                                                                                                                                                                                                                                                                                       | Evaluate and choose the right multi-agent framework                                       |
 
 ### Part III: Evaluating and Optimizing Multi-Agent Systems
 
-| Chapter  | Title                          | Code                                                                                                         | Learning Outcome                                          |
-| -------- | ------------------------------ | ------------------------------------------------------------------------------------------------------------ | --------------------------------------------------------- |
-| **Ch 8** | Evaluating Multi-Agent Systems | [`eval/`](picoagents/src/picoagents/eval/), [`agent-evaluation.py`](examples/evaluation/agent-evaluation.py) | Build evaluation frameworks with LLM-as-judge and metrics |
+| Chapter   | Title                          | Code                                                                                                         | Learning Outcome                                          |
+| --------- | ------------------------------ | ------------------------------------------------------------------------------------------------------------ | --------------------------------------------------------- |
+| **Ch 10** | Evaluating Multi-Agent Systems | [`eval/`](picoagents/src/picoagents/eval/), [`agent-evaluation.py`](examples/evaluation/agent-evaluation.py) | Build evaluation frameworks with LLM-as-judge and metrics |
 
 ### Part IV: Real-World Applications
 
 | Chapter   | Title                                     | Code                                              | Learning Outcome                                                                         |
 | --------- | ----------------------------------------- | ------------------------------------------------- | ---------------------------------------------------------------------------------------- |
-| **Ch 13** | Business Questions from Unstructured Data | [`yc_analysis/`](examples/workflows/yc_analysis/) | Production case study: Analyze 5,000+ companies with cost optimization and checkpointing |
+| **Ch 16** | Business Questions from Unstructured Data | [`yc_analysis/`](examples/workflows/yc_analysis/) | Production case study: Analyze 5,000+ companies with cost optimization and checkpointing |
+| **Ch 17** | Software Engineering Agent                | [`swe_agent/`](examples/agents/swe_agent/)        | Build a complete software engineering agent with coding tools and workspace management   |
 
 ## Getting Started
 
 
@@ -23,6 +23,7 @@
 from picoagents.llm import AzureOpenAIChatCompletionClient
 from picoagents.tools import (
     MemoryTool,
+    TaskStatusTool,
     ThinkTool,
     create_coding_tools,
 )
@@ -112,6 +113,32 @@ async def main():
    - Document: what the problem was, your solution, why it works
 3. Update /memories/current_task.md with completion status
 
+## PHASE 5: TASK COMPLETION (CRITICAL - ALWAYS DO THIS)
+Before finishing, ALWAYS call task_status tool to formally evaluate completion:
+
+If ALL requirements are satisfied:
+  task_status(
+    status="complete",
+    rationale="Detailed explanation of how each requirement was met with evidence",
+    requirements_met=["List each requirement satisfied"]
+  )
+
+If unable to complete (blocked, need input, hit limits):
+  task_status(
+    status="incomplete",
+    rationale="Explain the blocker, what was tried, why stopping now",
+    requirements_pending=["List what remains"]
+  )
+
+Example complete rationale:
+"✓ Requirement 1 (4 functions): Created add, subtract, multiply, divide in calculator.py
+ ✓ Requirement 2 (error handling): divide() raises ValueError for zero divisor
+ ✓ Requirement 3 (tests): Created test_calculator.py with 12 tests, all passed
+ ✓ Requirement 4 (documentation): Added comprehensive docstrings to all functions
+All requirements verified and complete."
+
+NEVER finish without calling task_status. This documents WHY you're stopping.
+
 ## MEMORY ORGANIZATION
 - /memories/patterns/: Reusable solutions, code patterns, common bugs
 - /memories/decisions/: Why we chose specific approaches (dated logs)
@@ -122,6 +149,7 @@ async def main():
 - ALWAYS check memory before starting a task
 - ALWAYS test code changes when possible
 - ALWAYS log important decisions
+- ALWAYS call task_status before finishing
 - Use 'think' tool for complex reasoning
 - Keep memory organized and searchable
 - Write clear, concise documentation
@@ -131,13 +159,15 @@ async def main():
 - If a command fails, analyze the error and try alternative approaches
 - Log failures and solutions to help future tasks
 - Don't give up after first failure - iterate
+- If blocked after retries, call task_status with incomplete and explain
 
 Remember: Your memory persists across sessions. Build up knowledge!
 """,
         model_client=client,
         tools=[
             memory_tool,
             ThinkTool(),
+            TaskStatusTool(),
             *create_coding_tools(workspace=workspace, bash_timeout=60),
         ],
         max_iterations=50,  # Allow longer execution for complex tasks