nshkrdotcom
diff --git a/‎README.md‎
Lines changed: 2 additions & 0 deletions b/‎README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/20250704_yaml_format_v2/01_complete_schema_reference.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/20250704_yaml_format_v2/01_complete_schema_reference.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/20250704_yaml_format_v2/02_step_types_reference.md‎
Lines changed: 56 additions & 3 deletions b/‎docs/20250704_yaml_format_v2/02_step_types_reference.md‎
Lines changed: 56 additions & 3 deletions
diff --git a/‎docs/20250704_yaml_format_v2/06_advanced_features.md‎
Lines changed: 3 additions & 0 deletions b/‎docs/20250704_yaml_format_v2/06_advanced_features.md‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎docs/20250704_yaml_format_v2/08_best_practices_patterns.md‎
Lines changed: 60 additions & 0 deletions b/‎docs/20250704_yaml_format_v2/08_best_practices_patterns.md‎
Lines changed: 60 additions & 0 deletions
diff --git a/‎docs/20250704_yaml_format_v2/09_migration_guide.md‎
Lines changed: 10 additions & 2 deletions b/‎docs/20250704_yaml_format_v2/09_migration_guide.md‎
Lines changed: 10 additions & 2 deletions
diff --git a/‎docs/20250704_yaml_format_v2/10_quick_reference.md‎
Lines changed: 29 additions & 0 deletions b/‎docs/20250704_yaml_format_v2/10_quick_reference.md‎
Lines changed: 29 additions & 0 deletions
diff --git a/‎examples/model_selection_demo.yaml‎
Lines changed: 126 additions & 0 deletions b/‎examples/model_selection_demo.yaml‎
Lines changed: 126 additions & 0 deletions
@@ -163,6 +163,7 @@ Application.put_env(:pipeline, :test_mode, true)
 
 ### 🤖 AI Integration
 - 🤖 **Multi-AI Integration**: Chain Claude and Gemini APIs together
+- 💰 **Model Selection & Cost Control**: Choose between Claude models (sonnet ~$0.01 vs opus ~$0.26 per query)
 - 🔄 **Flexible Execution Modes**: Mock, Live, and Mixed modes for testing
 - 📋 **YAML Workflow Configuration**: Define complex multi-step workflows
 - 🎯 **Structured Output**: JSON-based responses with proper error handling
@@ -171,6 +172,7 @@ Application.put_env(:pipeline, :test_mode, true)
 
 ### ⚡ Advanced Features
 - **Enhanced Claude Steps**: Smart presets, sessions, extraction, batch processing, robust error handling
+- **Model Selection**: Automatic cost optimization (development=sonnet, production=opus+fallback, analysis=opus)
 - **Genesis Pipeline**: Self-improving AI system that generates other pipelines
 - **Session Management**: Persistent conversations with automatic checkpointing
 - **Fault Tolerance**: Retry mechanisms, circuit breakers, graceful degradation
 
@@ -80,6 +80,8 @@ defaults:
   # Model configuration
   gemini_model: string            # Default Gemini model
   claude_preset: enum             # Default Claude preset
+  claude_model: string            # Default Claude model ("sonnet", "opus", specific version)
+  claude_fallback_model: string   # Default Claude fallback model
 
   # Token configuration
   gemini_token_budget:
 
@@ -103,6 +103,10 @@ Pipeline supports 17+ distinct step types organized into four categories:
     output_format: "json"         # Response format
     verbose: true                 # Detailed logging
     
+    # Model selection
+    model: "sonnet"               # Model choice: "sonnet", "opus", or specific version
+    fallback_model: "sonnet"      # Fallback when primary model overloaded
+    
     # Tool permissions
     allowed_tools: ["Write", "Edit", "Read", "Bash", "Search"]
     disallowed_tools: ["Delete"]  # Explicitly forbidden
@@ -151,6 +155,8 @@ Pipeline supports 17+ distinct step types organized into four categories:
 - Session management for continuity
 - Comprehensive error handling
 - Cost tracking and telemetry
+- Model selection for cost optimization (25x savings: sonnet vs opus)
+- Fallback model support for reliability
 
 ### Claude Smart
 
@@ -177,16 +183,63 @@ Pipeline supports 17+ distinct step types organized into four categories:
 ```
 
 **Available Presets**:
-- `development`: Permissive settings, full tool access, verbose logging
-- `production`: Restricted tools, optimized for safety and performance
-- `analysis`: Read-only tools, optimized for code analysis
+- `development`: Permissive settings, full tool access, verbose logging (uses sonnet - cost-effective)
+- `production`: Restricted tools, optimized for safety and performance (uses opus with sonnet fallback)
+- `analysis`: Read-only tools, optimized for code analysis (uses opus - best capability)
 - `chat`: Simple conversation mode, basic tools
 
 **Key Features**:
 - Automatic configuration based on environment
 - Preset-specific optimizations
 - Simplified configuration
 - Intelligent defaults
+- Built-in model selection for cost optimization
+
+## Model Selection & Cost Control
+
+All Claude step types support model selection for cost optimization and performance tuning:
+
+### Model Options
+
+```yaml
+claude_options:
+  # Simple shortcuts (recommended)
+  model: "sonnet"               # Fast, cost-effective (~$0.01 per query)
+  model: "opus"                 # Highest quality (~$0.26 per query, 25x more expensive)
+  
+  # Specific model versions (for reproducibility)
+  model: "claude-3-5-sonnet-20241022"
+  model: "claude-3-opus-20240229"
+  
+  # Production reliability with fallback
+  model: "opus"
+  fallback_model: "sonnet"      # Falls back when opus overloaded
+```
+
+### Cost Optimization Examples
+
+```yaml
+# Development workflow - cost-effective
+- name: "dev_task"
+  type: "claude_smart"
+  preset: "development"         # Automatically uses sonnet
+  
+# Production workflow - quality + reliability  
+- name: "prod_task"
+  type: "claude_smart"
+  preset: "production"          # Uses opus with sonnet fallback
+  
+# Manual cost control
+- name: "simple_task"
+  type: "claude"
+  claude_options:
+    model: "sonnet"             # 25x cheaper for simple tasks
+    
+- name: "complex_analysis"
+  type: "claude"
+  claude_options:
+    model: "opus"               # Worth the cost for complex work
+```
 
 ### Claude Session
 
 
@@ -726,6 +726,8 @@ Maintain conversation state:
     description: "Developing authentication feature"
   
   claude_options:
+    model: "opus"                     # High-quality for complex sessions
+    fallback_model: "sonnet"          # Fallback for reliability
     max_turns: 20
     allowed_tools: ["Write", "Edit", "Read", "Bash"]
   
@@ -895,6 +897,7 @@ Execute operations concurrently:
 
   task_template:
     claude_options:
+      model: "sonnet"                 # Cost-effective for batch processing
       max_turns: 5
       allowed_tools: ["Read"]
     prompt:
 
@@ -432,6 +432,66 @@ workflow:
 
 ## Performance Optimization
 
+### Model Selection for Cost Optimization
+
+Choose appropriate models for different task complexities:
+
+```yaml
+# GOOD: Cost-optimized workflow
+workflow:
+  name: "smart_code_review"
+  
+  steps:
+    # Simple tasks - use cost-effective model
+    - name: "syntax_check"
+      type: "claude"
+      claude_options:
+        model: "sonnet"         # ~$0.01 per query
+        max_turns: 1
+      prompt:
+        - type: "static"
+          content: "Check for basic syntax errors"
+    
+    # Complex analysis - use high-quality model
+    - name: "security_audit"
+      type: "claude"
+      claude_options:
+        model: "opus"           # ~$0.26 per query (25x more expensive)
+        fallback_model: "sonnet" # Fallback when overloaded
+        max_turns: 5
+      prompt:
+        - type: "static"
+          content: "Perform comprehensive security analysis"
+    
+    # Use smart presets for automatic selection
+    - name: "development_task"
+      type: "claude_smart"
+      preset: "development"     # Automatically uses sonnet
+      
+    - name: "production_task"
+      type: "claude_smart"
+      preset: "production"      # Uses opus + sonnet fallback
+```
+
+### Model Selection Best Practices
+
+1. **Development workflows**: Use `sonnet` for cost-effective iteration
+2. **Production workflows**: Use `opus` with `sonnet` fallback for reliability
+3. **Analysis tasks**: Use `opus` for best capability
+4. **Simple tasks**: Always use `sonnet` to minimize costs
+5. **Batch processing**: Consider cost per query when processing large datasets
+
+```yaml
+# Cost comparison example
+defaults:
+  # Development environment - optimize for cost
+  claude_model: "sonnet"        # $0.01/query × 100 queries = $1.00
+  
+  # Production environment - optimize for quality + reliability
+  claude_model: "opus"          # $0.26/query × 100 queries = $26.00
+  claude_fallback_model: "sonnet"
+```
+
 ### Lazy Loading Strategy
 
 Load resources only when needed:
 
@@ -42,14 +42,22 @@ This guide helps you migrate existing Pipeline YAML v1 configurations to the v2
    type: "claude_robust"   # Enterprise error handling
    ```
 
-3. **Advanced Control Flow**
+3. **Model Selection & Cost Control**
+   ```yaml
+   claude_options:
+     model: "sonnet"         # Cost-effective (~$0.01/query)
+     model: "opus"           # High-quality (~$0.26/query)
+     fallback_model: "sonnet" # Reliability fallback
+   ```
+
+4. **Advanced Control Flow**
    ```yaml
    type: "for_loop"     # Iteration
    type: "while_loop"   # Conditional loops
    type: "switch"       # Multi-branch logic
    ```
 
-4. **Data Operations**
+5. **Data Operations**
    ```yaml
    type: "data_transform"   # JSONPath transformations
    type: "file_ops"        # File manipulation
 
@@ -18,6 +18,8 @@ workflow:
   # Defaults for all steps
   defaults:
     gemini_model: "gemini-2.5-flash"
+    claude_model: "sonnet"              # Model selection: sonnet|opus|specific_version
+    claude_fallback_model: "sonnet"     # Fallback for reliability
     timeout_seconds: 300
 
   # Step definitions
@@ -315,6 +317,33 @@ output_schema:
             maximum: 100
 ```
 
+## Model Selection & Cost Control
+
+```yaml
+# Cost-effective development (sonnet ~$0.01/query)
+- type: "claude"
+  claude_options:
+    model: "sonnet"
+
+# High-quality production (opus ~$0.26/query, 25x more expensive)
+- type: "claude"
+  claude_options:
+    model: "opus"
+    fallback_model: "sonnet"    # Fallback for reliability
+
+# Smart presets with automatic model selection
+- type: "claude_smart"
+  preset: "development"         # Uses sonnet (cost-effective)
+  preset: "production"          # Uses opus + sonnet fallback
+  preset: "analysis"            # Uses opus (best capability)
+
+# Specific model versions for reproducibility
+- type: "claude"
+  claude_options:
+    model: "claude-3-5-sonnet-20241022"
+    fallback_model: "claude-3-opus-20240229"
+```
+
 ## Environment Modes
 
 ```yaml
 
@@ -0,0 +1,126 @@
+workflow:
+  name: "model_selection_demo"
+  description: "Demonstrate Claude model selection for cost optimization"
+  version: "2.0"
+  
+  defaults:
+    claude_preset: "development"
+    output_dir: "./outputs/model_selection_demo"
+    
+  steps:
+    # Example 1: Cost-effective development workflow
+    - name: "simple_code_review"
+      type: "claude_smart"
+      preset: "development"              # Uses sonnet automatically (cost-effective)
+      
+      prompt:
+        - type: "static"
+          content: |
+            Review this simple function for basic issues:
+            
+            ```python
+            def add_numbers(a, b):
+                return a + b
+            ```
+            
+            Just check for basic syntax and style issues.
+            
+      output_to_file: "simple_review.json"
+      
+    # Example 2: High-quality analysis workflow  
+    - name: "complex_architecture_analysis"
+      type: "claude_smart" 
+      preset: "analysis"                 # Uses opus automatically (best capability)
+      
+      prompt:
+        - type: "static"
+          content: |
+            Analyze this complex microservices architecture for:
+            - Security vulnerabilities
+            - Performance bottlenecks  
+            - Scalability issues
+            - Design pattern violations
+            - Technical debt
+            
+            Provide detailed recommendations with specific implementation steps.
+            
+      output_to_file: "architecture_analysis.json"
+      
+    # Example 3: Production workflow with fallback
+    - name: "production_deployment"
+      type: "claude_smart"
+      preset: "production"               # Uses opus with sonnet fallback
+      
+      prompt:
+        - type: "static"
+          content: |
+            Generate a production deployment checklist for a critical banking application.
+            Include security checks, rollback procedures, and monitoring setup.
+            
+      output_to_file: "deployment_checklist.json"
+      
+    # Example 4: Manual model selection - cost optimization
+    - name: "batch_documentation"
+      type: "claude"
+      
+      claude_options:
+        model: "sonnet"                  # Explicit cost-effective choice
+        max_turns: 3
+        allowed_tools: ["Read", "Write"]
+        
+      prompt:
+        - type: "static"
+          content: |
+            Generate basic documentation for these functions.
+            Keep it simple and concise.
+            
+      output_to_file: "basic_docs.md"
+      
+    # Example 5: Manual model selection - high quality  
+    - name: "critical_security_audit"
+      type: "claude"
+      
+      claude_options:
+        model: "opus"                    # Explicit high-quality choice
+        fallback_model: "sonnet"         # Fallback for reliability
+        max_turns: 10
+        allowed_tools: ["Read", "Glob", "Grep"]
+        
+      prompt:
+        - type: "static"
+          content: |
+            Perform a comprehensive security audit of the entire codebase.
+            Look for:
+            - SQL injection vulnerabilities
+            - XSS attack vectors
+            - Authentication bypass issues
+            - Data exposure risks
+            - Cryptographic weaknesses
+            
+            Provide detailed findings with remediation steps.
+            
+      output_to_file: "security_audit.json"
+      
+    # Example 6: Cost comparison demonstration
+    - name: "cost_comparison_summary"
+      type: "gemini"
+      model: "gemini-2.5-flash"
+      
+      prompt:
+        - type: "static"
+          content: |
+            Summarize the cost implications of this pipeline:
+            
+            - simple_code_review: Used sonnet (~$0.01)
+            - complex_architecture_analysis: Used opus (~$0.26)  
+            - production_deployment: Used opus with fallback (~$0.26)
+            - batch_documentation: Used sonnet (~$0.01)
+            - critical_security_audit: Used opus (~$0.26)
+            
+            Total estimated cost: ~$0.80
+            Cost if everything used opus: ~$1.30 (63% more expensive)
+            Cost if everything used sonnet: ~$0.05 (94% cheaper, but lower quality for complex tasks)
+            
+            Explain the cost optimization strategy.
+            
+      output_to_file: "cost_analysis.json"