docs: add WHY_SIARE comparison guide

danielbentes · claude · danielbentes · commit b6105583ef90 · 2025-12-17T18:51:00.000+01:00
- Technology positioning matrix comparing SIARE vs alternatives - Detailed comparison with LangChain, DSPy, AutoRAG, Self-RAG, LangGraph - Six mutation types explained with scope and description - Quality-Diversity benefits and when to use SIARE - Updated docs README with new page in navigation Extracted from competitive analysis, focusing on technical capabilities rather than business strategy. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
diff --git a/docs/README.md b/docs/README.md
@@ -10,6 +10,7 @@ Welcome to the SIARE (Self-Improving Agentic RAG Engine) documentation.
 
 | Goal | Guide |
 |------|-------|
+| Understand what makes SIARE unique | [Why SIARE?](WHY_SIARE.md) |
 | Run your first pipeline | [Quick Start](QUICKSTART.md) |
 | Configure SIARE | [Configuration](CONFIGURATION.md) |
 | Fix common issues | [Troubleshooting](TROUBLESHOOTING.md) |
@@ -43,6 +44,7 @@ Welcome to the SIARE (Self-Improving Agentic RAG Engine) documentation.
 ```
 docs/
 ├── Getting Started
+│   ├── WHY_SIARE.md               # What makes SIARE unique
 │   ├── QUICKSTART.md              # 10-minute first run
 │   ├── CONFIGURATION.md           # All settings explained
 │   └── TROUBLESHOOTING.md         # Common issues
diff --git a/docs/WHY_SIARE.md b/docs/WHY_SIARE.md
@@ -0,0 +1,174 @@
+# Why SIARE?
+
+**How SIARE compares to other RAG frameworks and what makes it unique.**
+
+## The Problem with Manual RAG Tuning
+
+Building RAG systems today means endless iteration:
+- Tweak prompts → benchmark → repeat
+- Try different chunking strategies → benchmark → repeat
+- Adjust retrieval parameters → benchmark → repeat
+- Add more agents → debug interactions → repeat
+
+This process is **expensive**, **brittle**, and **never-ending** as your data and requirements change.
+
+## How SIARE is Different
+
+SIARE is the first RAG engine that treats pipeline configuration as **evolvable genetic material**. Instead of manual tuning, SIARE uses AI-driven evolution to automatically discover optimal strategies.
+
+### What SIARE Does That Others Don't
+
+| Capability | Description |
+|------------|-------------|
+| **SOP-as-Genome** | Treats entire multi-agent pipeline configurations as evolvable genetic material with `PromptGenome`, mutation operators, and ancestry tracking |
+| **Quality-Diversity Optimization** | Maintains Pareto frontier across multiple metrics and QD Grid for behavioral diversity—prevents convergence to single local optimum |
+| **Multi-Agent Topology Evolution** | Evolves agent roles, graph structure, tool assignments, and inter-agent communication patterns—not just prompts |
+| **Plugin-Based Prompt Evolution** | Adaptive strategy selection combining TextGrad, EvoPrompt, and MetaPrompt approaches |
+| **GenePool with Ancestry** | Full lineage tracking enabling "breeding" of successful configurations |
+| **Constraint-Aware Evolution** | Safety-first design with `validate_constraints()` before mutations |
+
+---
+
+## Comparison with Alternatives
+
+### RAG Frameworks (LangChain, LlamaIndex, Haystack)
+
+These frameworks provide excellent **building blocks** for RAG but require manual configuration:
+
+- **What they do well**: Modular components, tool integrations, document processing
+- **What they don't do**: Autonomous pipeline evolution, systematic optimization, diversity maintenance
+
+**SIARE's approach**: Use these frameworks as adapters within SIARE, then let evolution optimize the pipeline configuration.
+
+### Prompt Optimization (DSPy)
+
+DSPy pioneered treating prompts as tunable parameters with automated optimization:
+
+- **What it does well**: Bayesian optimization of prompts, few-shot selection
+- **What it doesn't do**: Multi-agent topology evolution, Quality-Diversity optimization, pipeline structure mutation
+
+**SIARE's approach**: DSPy optimizes prompts within a fixed pipeline; SIARE evolves the entire pipeline including topology.
+
+### AutoML for RAG (AutoRAG, RAGSmith)
+
+These tools automate hyperparameter search:
+
+- **What they do well**: Grid search over configurations, benchmark evaluation
+- **What they don't do**: Quality-Diversity (they seek single optimum), multi-agent orchestration, ancestry tracking
+
+**SIARE's approach**: Evolutionary algorithms with QD optimization maintain diverse solutions, not just the single best.
+
+### Self-Improving RAG (Self-RAG, Adaptive-RAG)
+
+Research on runtime adaptation:
+
+- **What they do well**: Dynamic retrieval decisions, self-critique mechanisms
+- **What they don't do**: Design-time evolution, population-based search, pipeline mutation
+
+**SIARE's approach**: Evolves pipelines at design-time; these approaches adapt at runtime. Complementary, not competing.
+
+### Multi-Agent Frameworks (LangGraph, AutoGen, CrewAI)
+
+Multi-agent orchestration platforms:
+
+- **What they do well**: Agent collaboration, stateful workflows, role-based systems
+- **What they don't do**: Autonomous evolution of agent topologies, QD optimization, prompt genome evolution
+
+**SIARE's approach**: Define agents as evolvable roles; let evolution discover optimal team structures.
+
+---
+
+## Technology Positioning Matrix
+
+| Capability | LangChain | DSPy | AutoRAG | Self-RAG | LangGraph | **SIARE** |
+|-----------|-----------|------|---------|----------|-----------|-----------|
+| Multi-Agent Orchestration | ✅ | ❌ | ❌ | ❌ | ✅ | ✅ |
+| Prompt Evolution | ❌ | ✅ | ❌ | ❌ | ❌ | ✅ |
+| Pipeline Evolution | ❌ | ❌ | ✅ | ❌ | ❌ | ✅ |
+| Quality-Diversity | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ |
+| Evolutionary Algorithms | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ |
+| Topology Mutation | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ |
+| Ancestry Tracking | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ |
+| Constraint Validation | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ |
+| SOP Versioning | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ |
+
+---
+
+## SIARE's Six Mutation Types
+
+SIARE evolves pipelines through six mutation operators:
+
+| Mutation | Scope | Description |
+|----------|-------|-------------|
+| `PROMPT_CHANGE` | Prompts | Modify role prompts using TextGrad, EvoPrompt, or MetaPrompt |
+| `PARAM_TWEAK` | Parameters | Adjust temperature, max_tokens, top_k, etc. |
+| `ADD_ROLE` | Topology | Introduce new specialist agent |
+| `REMOVE_ROLE` | Topology | Remove underperforming agent |
+| `REWIRE_GRAPH` | Topology | Change edge connections between agents |
+| `CROSSOVER` | Structure | Combine successful elements from two SOPs |
+
+---
+
+## Quality-Diversity: Why Diversity Matters
+
+Traditional optimization finds **one best solution**. But RAG has multiple valid objectives:
+- High accuracy vs. low latency
+- Comprehensive answers vs. concise answers
+- High recall vs. high precision
+
+**Quality-Diversity optimization** maintains a diverse archive of high-performing solutions along the Pareto frontier. This means:
+
+1. **No local optima trap**: Multiple search directions explored simultaneously
+2. **User choice**: Pick the solution that best fits your specific tradeoffs
+3. **Robustness**: If requirements change, alternative solutions already exist
+4. **Insight**: Understanding the capability frontier reveals fundamental tradeoffs
+
+---
+
+## When to Use SIARE
+
+**SIARE is ideal when:**
+- You need to optimize across multiple metrics (accuracy, cost, latency)
+- Your domain is complex with many possible pipeline configurations
+- Manual tuning has plateaued or is too expensive
+- You want to maintain diverse solutions for different use cases
+- Your requirements evolve and pipelines need to adapt
+
+**SIARE may be overkill when:**
+- You have a simple, well-understood retrieval task
+- A single metric dominates (pure accuracy, no cost/latency concerns)
+- Your pipeline is fixed by external constraints
+- You need a quick prototype without evolution
+
+---
+
+## Getting Started
+
+Ready to stop tuning and start evolving?
+
+```bash
+pip install siare
+
+# Initialize a project
+siare init
+
+# Run evolution
+siare evolve --generations 10 --metric quality
+
+# Query your evolved pipeline
+siare run "How do I reset my password?"
+```
+
+See the [Quick Start Guide](QUICKSTART.md) for detailed setup instructions.
+
+---
+
+## Learn More
+
+- [System Architecture](architecture/SYSTEM_ARCHITECTURE.md) — How SIARE works internally
+- [Custom Extensions](guides/custom-extensions.md) — Add your own metrics and tools
+- [Use Cases](guides/USE_CASES.md) — Domain implementation patterns
+
+---
+
+*SIARE: Stop tuning. Start evolving.*