Smart-AI-Memory
diff --git a/‎docs/FOREWORD_BY_CLAUDE.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/FOREWORD_BY_CLAUDE.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/guides/foreword.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/guides/foreword.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/marketing/POST_SCHEDULE_DEC_2025.md‎
Lines changed: 35 additions & 26 deletions b/‎docs/marketing/POST_SCHEDULE_DEC_2025.md‎
Lines changed: 35 additions & 26 deletions
diff --git a/‎docs/marketing/drafts/DEVTO_ARTICLE.md‎
Lines changed: 102 additions & 87 deletions b/‎docs/marketing/drafts/DEVTO_ARTICLE.md‎
Lines changed: 102 additions & 87 deletions
@@ -66,4 +66,4 @@ Whether that participant is human or AI.
 
 ---
 
-*This foreword was written during working sessions where Claude and Patrick built Redis-backed short-term memory for multi-agent coordination. The framework now includes 53 wizards across healthcare, software, coach, and domain categories, with over 2,200 tests ensuring reliability.*
+*This foreword was written during working sessions where Claude and Patrick built Redis-backed short-term memory for multi-agent coordination. The framework now includes 53 wizards across healthcare, software, coach, and domain categories, with over 3,000 tests ensuring reliability.*
@@ -67,4 +67,4 @@ Whether that participant is human or AI.
 ---
 
 !!! note "Context"
-    This foreword was written during working sessions where Claude and Patrick built Redis-backed short-term memory for multi-agent coordination. The framework now includes 53 wizards across healthcare, software, coach, and domain categories, with over 2,200 tests ensuring reliability.
+    This foreword was written during working sessions where Claude and Patrick built Redis-backed short-term memory for multi-agent coordination. The framework now includes 53 wizards across healthcare, software, coach, and domain categories, with over 3,000 tests ensuring reliability.
@@ -1,41 +1,50 @@
 # Marketing Post Schedule - December 2025
 
 **Start Date:** December 26, 2025
-**Version:** v3.2.5
+**Version:** v3.3.0 (Enterprise-Ready Workflows)
+
+---
+
+## v3.3.0 Key Messages
+
+1. **Formatted Reports for All Workflows** - Professional, consistent output across all 10 workflows
+2. **Enterprise Doc-Gen** - Auto-scaling, chunked generation, $5 cost guardrails, file export
+3. **Output Chunking** - Large reports split automatically for display
+4. **80-96% Cost Savings** - Smart tier routing (cheap/capable/premium)
 
 ---
 
 ## Week 1: Dec 26-28 (Post-Holiday Launch)
 
 ### Thursday, Dec 26
 
-- [ ] **Dev.to Article** (Morning)
+- [ ] **Dev.to Article** (Morning - 9am EST)
   - Draft: `docs/marketing/drafts/DEVTO_ARTICLE.md`
-  - Title: "Give Your AI Persistent Memory (and Cut Costs 80%)"
+  - Title: "Enterprise-Ready AI Workflows: Formatted Reports + 80% Cost Savings"
+  - Highlight: v3.3.0 formatted reports, doc-gen enterprise features
   - Set `published: true` when ready
-  - Add cover image (code screenshot or dashboard)
 
-- [ ] **r/ClaudeAI Post** (Afternoon)
+- [ ] **r/ClaudeAI Post** (Afternoon - 1pm EST)
   - Draft: `docs/marketing/drafts/REDDIT_POSTS.md` (first section)
-  - Title: "I built persistent memory for Claude that survives across sessions (+ 80% cost savings)"
+  - Title: "v3.3.0: Enterprise-ready workflows with formatted reports + persistent memory"
   - Best time: 10am-2pm EST
 
 ### Friday, Dec 27
 
-- [ ] **Twitter/X Thread** (Morning)
+- [ ] **Twitter/X Thread** (Morning - 9am EST)
   - Draft: `docs/marketing/drafts/TWITTER_THREAD.md`
-  - Post 5-tweet thread
-  - Include code screenshot for tweet 2
+  - Post 5-tweet thread focusing on enterprise features
+  - Include screenshot of formatted report output
 
-- [ ] **r/Python Post** (Afternoon)
+- [ ] **r/Python Post** (Afternoon - 1pm EST)
   - Draft: `docs/marketing/drafts/REDDIT_POSTS.md` (second section)
-  - Title: "empathy-framework: Persistent LLM memory + smart routing (80% cost savings)"
+  - Title: "empathy-framework v3.3.0: Enterprise-ready AI workflows with formatted reports"
 
 ### Saturday, Dec 28
 
 - [ ] **r/LocalLLaMA Post**
   - Draft: `docs/marketing/drafts/REDDIT_POSTS.md` (third section)
-  - Title: "Cross-session memory for local LLMs - native Ollama support (v3.2.5)"
+  - Title: "Enterprise doc-gen for local LLMs - auto-scaling, cost guardrails (v3.3.0)"
 
 ---
 
@@ -44,14 +53,14 @@
 ### Monday, Dec 30
 
 - [ ] **LinkedIn Post**
-  - Adapt Dev.to article for LinkedIn
-  - Focus on cost savings for enterprise angle
+  - Focus on enterprise features: formatted reports, cost guardrails
+  - Target: Engineering managers, CTOs
 
 ### Tuesday, Dec 31
 
 - [ ] **Hashnode Article**
   - Draft: `docs/marketing/drafts/HASHNODE_ARTICLE.md`
-  - Cross-post from Dev.to with modifications
+  - Cross-post from Dev.to with v3.3.0 updates
 
 ### Wednesday, Jan 1 (Optional - Low traffic)
 
@@ -61,13 +70,13 @@
 
 - [ ] **Indie Hackers Post**
   - Draft: `docs/marketing/drafts/INDIE_HACKERS_POST.md`
-  - Focus on building in public angle
+  - Focus: "Built enterprise features based on user feedback"
 
 ### Friday, Jan 3
 
 - [ ] **Medium Article**
   - Draft: `docs/marketing/drafts/MEDIUM_ARTICLE.md`
-  - SEO-focused, evergreen content
+  - SEO-focused, evergreen content on AI workflows
 
 ---
 
@@ -76,7 +85,7 @@
 ### Tuesday, Jan 7 or Wednesday, Jan 8
 
 - [ ] **Hacker News - Show HN**
-  - Title: "Show HN: Empathy Framework – Persistent memory for LLMs (80% cost savings)"
+  - Title: "Show HN: Empathy Framework v3.3.0 - Enterprise AI workflows with 80% cost savings"
   - Best time: 9-11am EST, Tuesday or Wednesday
   - Be ready to engage in comments for first 2 hours
 
@@ -103,14 +112,14 @@ After each post, note:
 
 ## Quick Reference
 
-| Platform | Draft Location | Best Time |
-|----------|---------------|-----------|
-| Dev.to | `DEVTO_ARTICLE.md` | Morning |
-| Twitter | `TWITTER_THREAD.md` | 9-11am EST |
-| r/ClaudeAI | `REDDIT_POSTS.md` | 10am-2pm EST |
-| r/Python | `REDDIT_POSTS.md` | 10am-2pm EST |
-| r/LocalLLaMA | `REDDIT_POSTS.md` | Anytime |
-| HN | Custom title | Tue/Wed 9-11am EST |
+| Platform | Draft Location | Best Time | Status |
+|----------|---------------|-----------|--------|
+| Dev.to | `DEVTO_ARTICLE.md` | 9am EST | Ready |
+| Twitter | `TWITTER_THREAD.md` | 9am EST | Ready |
+| r/ClaudeAI | `REDDIT_POSTS.md` | 1pm EST | Ready |
+| r/Python | `REDDIT_POSTS.md` | 1pm EST | Ready |
+| r/LocalLLaMA | `REDDIT_POSTS.md` | Anytime | Ready |
+| HN | Custom title | Tue/Wed 9-11am EST | Week 3 |
 
 ---
 
 
@@ -1,144 +1,159 @@
 ---
-title: Give Your AI Persistent Memory (and Cut Costs 80%)
+title: Enterprise-Ready AI Workflows: Formatted Reports + 80% Cost Savings
 published: false
-description: How to make Claude/GPT remember your preferences across sessions using the Empathy Framework v3.2.5
+description: How Empathy Framework v3.3.0 gives you professional reports, cost guardrails, and persistent memory for production AI
 tags: python, ai, claude, openai, llm
 cover_image:
 ---
 
-# Give Your AI Persistent Memory (and Cut Costs 80%)
+# Enterprise-Ready AI Workflows: Formatted Reports + 80% Cost Savings
 
-Every conversation with Claude starts from scratch. Tell it you prefer concise code examples, and next session? Forgotten.
+Just shipped v3.3.0 of Empathy Framework with features I wish existed when I was running AI at scale:
 
-Here's how to fix that—and save 80% on API costs while you're at it.
+1. **Formatted reports** for every workflow (finally, readable output)
+2. **Cost guardrails** so your doc-gen doesn't blow $50 overnight
+3. **File export** because 50k character terminal limits are real
 
-## The Problem
+Here's what changed—and why it matters.
 
-LLM APIs are stateless. Each request is independent. For simple Q&A, that's fine. But for:
+## The Problem with AI Workflows
 
-- Development assistants that learn your coding style
-- Support bots that remember customer history
-- Personal tools that adapt to preferences
+Most AI libraries return raw JSON or unstructured text. Fine for prototypes. Terrible for:
 
-...you need memory that persists.
+- Reports you need to share with stakeholders
+- Outputs you need to audit
+- Results that exceed terminal/UI display limits
 
-## The Solution: 10 Lines of Python
+## The Solution: Formatted Reports for All Workflows
+
+Every workflow in v3.3.0 now includes a `formatted_report` with consistent structure:
 
 ```python
-from empathy_llm_toolkit import EmpathyLLM
+from empathy_os.workflows import SecurityAuditWorkflow
 
-llm = EmpathyLLM(
-    provider="anthropic",  # or "openai", "ollama", "hybrid"
-    memory_enabled=True
-)
+workflow = SecurityAuditWorkflow()
+result = await workflow.execute(code=your_code)
 
-# This preference survives across sessions
-response = await llm.interact(
-    user_id="dev_123",
-    user_input="I prefer Python with type hints, no docstrings"
-)
+print(result.final_output["formatted_report"])
 ```
 
-That's it. Next time this user connects—even days later—the AI remembers.
-
-## Why This Actually Matters
-
-### 1. Cost Savings (80%)
+Output:
+```
+============================================================
+SECURITY AUDIT REPORT
+============================================================
+
+Status: NEEDS_ATTENTION
+Risk Score: 7.2/10
+Vulnerabilities Found: 3
+
+------------------------------------------------------------
+CRITICAL FINDINGS
+------------------------------------------------------------
+- SQL injection in user_query() at line 42
+- Hardcoded credentials in config.py
+- Missing input validation in API handler
+
+------------------------------------------------------------
+RECOMMENDATIONS
+------------------------------------------------------------
+1. Use parameterized queries
+2. Move secrets to environment variables
+3. Add input sanitization layer
+
+============================================================
+```
 
-Smart routing automatically picks the right model for each task:
+This works across all 10 workflows: security-audit, code-review, perf-audit, doc-gen, test-gen, and more.
 
-| Task | Model | Cost |
-|------|-------|------|
-| Summarize text | Haiku/GPT-4o-mini | $0.25/M tokens |
-| Fix bugs | Sonnet/GPT-4o | $3/M tokens |
-| Design architecture | Opus/o1 | $15/M tokens |
+## Enterprise Doc-Gen: Built for Large Projects
 
-**Real numbers:**
-- Without routing (all Opus): $4.05/complex task
-- With routing (tiered): $0.83/complex task
-- **Savings: 80%**
+The doc-gen workflow got a major upgrade for enterprise use:
 
 ```python
-llm = EmpathyLLM(provider="anthropic", enable_model_routing=True)
+from empathy_os.workflows import DocumentGenerationWorkflow
 
-# Automatically routes to Haiku
-await llm.interact(user_id="dev", user_input="Summarize this", task_type="summarize")
+workflow = DocumentGenerationWorkflow(
+    export_path="docs/generated",     # Auto-save to disk
+    max_cost=5.0,                     # Stop at $5 (prevent runaway costs)
+    chunked_generation=True,          # Handle large codebases
+    graceful_degradation=True,        # Partial results on errors
+)
 
-# Automatically routes to Opus
-await llm.interact(user_id="dev", user_input="Design the system", task_type="coordinate")
+result = await workflow.execute(
+    source_code=your_large_codebase,
+    doc_type="api_reference",
+    audience="developers"
+)
+
+# Full docs saved to disk automatically
+print(f"Saved to: {result.final_output['export_path']}")
 ```
 
-### 2. Bug Memory
+### What's New:
 
-My debugging wizard remembers every fix:
+| Feature | What It Does |
+|---------|--------------|
+| **Auto-scaling tokens** | 2000 tokens/section, scales to 64k for large projects |
+| **Chunked generation** | Generates in chunks of 3 sections to avoid truncation |
+| **Cost guardrails** | Stops at configurable limit ($5 default) |
+| **File export** | Saves .md and report to disk automatically |
+| **Output chunking** | Splits large reports for terminal display |
 
-```python
-result = await wizard.analyze({
-    "error_message": "TypeError: Cannot read property 'map' of undefined",
-    "file_path": "src/components/UserList.tsx"
-})
-
-print(result["historical_matches"])
-# "This looks like bug #247 from 3 months ago"
-# "Suggested fix: data?.items ?? []"
-```
+## Cost Savings: 80-96%
 
-Without memory, every bug starts from zero. With it, your AI assistant **remembers every fix** and suggests proven solutions.
+Smart tier routing still saves 80-96% on API costs:
 
-### 3. Provider Freedom
+```python
+from empathy_llm_toolkit import EmpathyLLM
 
-Not locked into one provider. Switch anytime:
+llm = EmpathyLLM(provider="hybrid", enable_model_routing=True)
 
-```bash
-empathy provider set anthropic  # Use Claude
-empathy provider set openai     # Use GPT
-empathy provider set ollama     # Use local models
-empathy provider set hybrid     # Best of each
+# Automatically routes to the right model
+await llm.interact(user_id="dev", task_type="summarize")     # → Haiku ($0.25/M)
+await llm.interact(user_id="dev", task_type="fix_bug")       # → Sonnet ($3/M)
+await llm.interact(user_id="dev", task_type="architecture")  # → Opus ($15/M)
 ```
 
-Use Ollama for sensitive code, Claude for complex reasoning, GPT for specific tasks.
+**Real savings:**
+- Without routing: $4.05/complex task
+- With routing: $0.83/complex task
+- **80% saved**
 
-## Smart Router
+## Persistent Memory
 
-Natural language routing—no need to know which tool to use:
+Your AI remembers across sessions:
 
 ```python
-from empathy_os.routing import SmartRouter
+llm = EmpathyLLM(provider="anthropic", memory_enabled=True)
 
-router = SmartRouter()
-
-# Natural language → right wizard
-decision = router.route_sync("Fix the security vulnerability in auth.py")
-print(f"Primary: {decision.primary_wizard}")  # → security-audit
-print(f"Confidence: {decision.confidence}")   # → 0.92
+# Preference survives across sessions
+response = await llm.interact(
+    user_id="dev_123",
+    user_input="I prefer Python with type hints"
+)
 ```
 
-Examples:
-- "Fix security in auth.py" → SecurityWizard
-- "Review this PR" → CodeReviewWizard
-- "Why is this slow?" → PerformanceWizard
+Next session—even days later—it remembers.
 
 ## Quick Start
 
 ```bash
 # Install
-pip install empathy-framework
-
-# Check available providers (auto-detects API keys)
-empathy provider status
+pip install empathy-framework==3.3.0
 
-# Set your provider
-empathy provider set anthropic
+# Configure provider
+python -m empathy_os.models.cli provider --set anthropic
 
 # See all commands
 empathy cheatsheet
 ```
 
-## What's in v3.2.5
+## What's in v3.3.0
 
-- **Unified CLI** — One `empathy` command with Rich output
-- **Dev Container** — Clone → Open in VS Code → Start coding
-- **Python 3.10-3.13** — Full test matrix across all versions
+- **Formatted Reports** — Consistent output across all 10 workflows
+- **Enterprise Doc-Gen** — Auto-scaling, cost guardrails, file export
+- **Output Chunking** — Large reports split for display
 - **Smart Router** — Natural language wizard dispatch
 - **Memory Graph** — Cross-wizard knowledge sharing
 
@@ -150,4 +165,4 @@ empathy cheatsheet
 
 ---
 
-*What would you build with an AI that remembers—and costs 80% less?*
+*What would you build with enterprise-ready AI workflows that cost 80% less?*
Original file line number	Diff line number	Diff line change
`@@ -66,4 +66,4 @@ Whether that participant is human or AI.`
`66`	`66`
`67`	`67`	`---`
`68`	`68`
`69`		`-This foreword was written during working sessions where Claude and Patrick built Redis-backed short-term memory for multi-agent coordination. The framework now includes 53 wizards across healthcare, software, coach, and domain categories, with over 2,200 tests ensuring reliability.`
	`69`	`+This foreword was written during working sessions where Claude and Patrick built Redis-backed short-term memory for multi-agent coordination. The framework now includes 53 wizards across healthcare, software, coach, and domain categories, with over 3,000 tests ensuring reliability.`