Add automatic conversation logging and sleep cycle memory extraction v0.6.0

ihwoo · claude · ihwoo · commit 5d9d9a1f4c4f · 2026-03-03T00:41:12.000+09:00
All conversation turns are now auto-logged to SQLite and high-value turns
are instantly extracted to ChromaDB. Sleep cycle batch-processes missed
memories using a progressive RL extraction pipeline (heuristic → RL).

New files: conversation_log.py, extraction.py
Modified: bridge.py, sleep_cycle.py, graph_store.py, config.py, server.py

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -5,6 +5,29 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
+## [0.6.0] - 2026-03-03
+
+### Added
+
+- **Automatic conversation logging** — All conversation turns are recorded to a SQLite append-only log (`conversation_log.db`) with WAL mode for concurrent safety
+- **Dual-saving in `auto_search`** — Every turn is logged to SQLite for batch processing, and high-value turns (matching personal/preference/tech/emotion patterns) are instantly extracted to ChromaDB
+- **Memory extraction pipeline** — New `extraction.py` module with three extractors:
+  - `HeuristicMemoryExtractor` — Pattern-matching based (reuses `extract_keywords()` / `classify_category()`)
+  - `RLMemoryExtractor` — MLP bandit for EXTRACT/SKIP binary decisions with imitation learning
+  - `ProgressiveExtraction` — Manages transition: `heuristic_only` → `rl_assisted` → `rl_primary`
+- **Sleep cycle extraction task** (Task 0) — Batch-processes unprocessed conversation logs to catch memories missed by real-time heuristics, with deduplication (similarity ≥ 0.90)
+- **Sleep cycle log cleanup** (Task 5) — Deletes processed logs older than 30 days (configurable)
+- **Auto category classification** — `memory_save` default category changed to `"auto"`, which auto-classifies content using pattern matching
+- **`extraction_source` metadata** — Tracks how each memory was created: `"heuristic"`, `"rl"`, `"auto"`, or `""` (manual)
+- **6 new extraction config options** in `SleepCycleConfig`: `enable_memory_extraction`, `extraction_max_turns`, `extraction_dedup_threshold`, `extraction_min_info_density`, `extraction_rl_confidence_threshold`, `log_retention_days`
+
+### Changed
+
+- `SleepCycleRunner` now accepts optional `conversation_log` parameter
+- `SleepCycleReport` includes `extraction` and `log_cleanup_deleted` fields
+- `MemoryNode` includes `extraction_source` field
+- `GraphMemoryStore.add_memory()` accepts `extraction_source` parameter
+
 ## [0.3.0] - 2026-03-01
 
 ### Changed
diff --git a/README.md b/README.md
@@ -20,6 +20,7 @@ Current AI memory tools have two critical problems:
 | Problem | How we solve it |
 |---------|----------------|
 | **Manual retrieval** — you must ask "do you remember X?" | `auto_search` runs every turn, injecting relevant memories automatically |
+| **Missed memories** — AI decides what to save, so experiences/stories get lost | Every turn is auto-logged; sleep cycle extracts what the AI missed |
 | **Token waste** — entire memory dump inserted into context | Multi-resolution composer selects top-K memories within a token budget |
 
 ## Key Features
@@ -30,8 +31,11 @@ Current AI memory tools have two critical problems:
 - **GraphRAG hybrid retrieval** — Vector similarity + graph traversal, fused and re-ranked by an RL re-ranker
 - **Auto-linking** — New memories automatically link to similar existing ones (similarity ≥ 0.92)
 - **Multi-resolution text** — Full text → summary → entity triples, composed within token budget
+- **Automatic conversation logging** — All turns recorded to SQLite; high-value turns instantly extracted to ChromaDB
+- **Sleep cycle memory extraction** — Batch-processes missed memories from conversation logs using progressive RL extraction
+- **Auto category classification** — `memory_save` auto-classifies content category from patterns
 - **Forgetting pipeline** — Decay-based aging with consolidation, pinning, and immutable protection
-- **Sleep cycle** — Periodic maintenance: dedup, compress, forget, checkpoint
+- **Sleep cycle** — Periodic maintenance: extraction, dedup, compress, forget, checkpoint
 - **Live graph** — Real-time WebSocket visualization of the memory graph
 - **Multilingual** — Korean and English pattern support out of the box
 
@@ -215,7 +219,7 @@ Open `http://127.0.0.1:8765` in a browser. Requires the `[live]` extra (`pip ins
 | `memory_pin` / `memory_unpin` | Protect memories from forgetting |
 | `memory_stats` | Total count and category breakdown |
 | `memory_visualize` | Generate interactive graph HTML |
-| `sleep_cycle_run` | Trigger maintenance (consolidation + forgetting + checkpoint) |
+| `sleep_cycle_run` | Trigger maintenance (extraction + consolidation + forgetting + checkpoint) |
 | `policy_status` | RL policy state (epsilon, action distribution, updates) |
 | `policy_decide` | Ask the RL policy for a SAVE/SKIP/RETRIEVE decision with reasoning |
 
@@ -243,7 +247,7 @@ All settings via environment variables:
 ```
 ┌─────────────────────────────────────────────────┐
 │                   MCP Client                     │
-│          (Claude Desktop / Claude Code)          │
+│     (Claude Desktop / Claude Code / OpenClaw)    │
 └────────────────────┬────────────────────────────┘
                      │ stdio (JSON-RPC)
 ┌────────────────────▼────────────────────────────┐
@@ -254,12 +258,12 @@ All settings via environment variables:
 │ RL Policy│ Retrieval│ Storage  │ Maintenance      │
 │          │          │          │                  │
 │ Rule-    │ ChromaDB │ Graph    │ Sleep Cycle      │
-│ Based +  │ vector + │ Memory   │ (consolidation,  │
-│ MLP      │ Knowledge│ Store    │  forgetting,     │
-│ Bandit   │ Graph    │          │  checkpoints)    │
-│          │ (GraphRAG)│         │                  │
-│ Re-ranker│          │          │                  │
-│ (11d MLP)│          │          │                  │
+│ Based +  │ vector + │ Memory   │ (extraction,     │
+│ MLP      │ Knowledge│ Store    │  consolidation,  │
+│ Bandit   │ Graph    │          │  forgetting,     │
+│          │ (GraphRAG)│         │  checkpoints)    │
+│ Re-ranker│          │ SQLite   │                  │
+│ (11d MLP)│          │ Conv Log │ Extraction RL    │
 └──────────┴──────────┴──────────┴─────────────────┘
          ↕ WebSocket (cross-process)
 ┌──────────────────────────────────────────────────┐
diff --git a/pyproject.toml b/pyproject.toml
@@ -1,6 +1,6 @@
 [project]
 name = "long-term-memory"
-version = "0.5.1"
+version = "0.6.0"
 description = "Long-term memory system for AI assistants — persistent, searchable, self-organizing memory powered by semantic search, knowledge graphs, and reinforcement learning"
 requires-python = ">=3.11,<3.14"
 license = "MIT"
diff --git a/src/aimemory/__init__.py b/src/aimemory/__init__.py
@@ -1,3 +1,3 @@
 """Long-Term Memory System for AI assistants."""
 
-__version__ = "0.4.2"
+__version__ = "0.6.0"
diff --git a/src/aimemory/config.py b/src/aimemory/config.py
@@ -183,6 +183,14 @@ class SleepCycleConfig(BaseModel):
     checkpoint_dir: str = "checkpoints/sleep_cycle"
     report_dir: str = "data/reports/sleep_cycle"
 
+    # Memory extraction from conversation logs
+    enable_memory_extraction: bool = True
+    extraction_max_turns: int = 500
+    extraction_dedup_threshold: float = 0.90
+    extraction_min_info_density: float = 0.1
+    extraction_rl_confidence_threshold: int = 50
+    log_retention_days: int = 30
+
 
 class ComposerConfig(BaseModel):
     """Context composer configuration."""
diff --git a/src/aimemory/mcp/bridge.py b/src/aimemory/mcp/bridge.py
@@ -5,11 +5,14 @@
 import logging
 import math
 import os
+import uuid as _uuid
+from pathlib import Path
 from typing import Any
 
 from aimemory.config import MCPServerConfig
 from aimemory.live_graph.notify import notify_live_graph
 from aimemory.memory.composer import ContextComposer
+from aimemory.memory.conversation_log import ConversationLog
 from aimemory.memory.graph_store import GraphMemoryStore, ImmutableMemoryError, MemoryNode
 from aimemory.memory.sleep_cycle import SleepCycleRunner
 from aimemory.online.policy import MemoryPolicyAgent, OnlinePolicy, StateEncoder
@@ -173,9 +176,21 @@ def __init__(
             reranker=self._reranker,
         )
 
+        # Conversation log for automatic turn recording
+        log_db_path = Path(self._persist_directory) / "conversation_log.db"
+        self._conversation_log = ConversationLog(log_db_path)
+        self._conversation_id = _uuid.uuid4().hex[:16]
+        self._turn_counter = 0
+
+        # Heuristic extractor for real-time extraction in auto_search
+        from aimemory.memory.extraction import HeuristicMemoryExtractor
+
+        self._heuristic_extractor = HeuristicMemoryExtractor()
+
         self._sleep_runner = SleepCycleRunner(
             store=self._store,
             policy=self._policy,
+            conversation_log=self._conversation_log,
         )
 
         # Track recent policy actions for status reporting
@@ -301,9 +316,50 @@ def auto_search(
         """Search for relevant memories and compose a context string.
 
         Returns dict with context string, memory count, token count, and details.
+        Also performs dual-saving: SQLite log + heuristic instant extraction.
         """
         import random
 
+        # ── Dual saving: record turn + heuristic instant extraction ──
+        try:
+            self._turn_counter += 1
+            # 1. Always log to SQLite (for sleep cycle batch processing)
+            self._conversation_log.append_turn(
+                conversation_id=self._conversation_id,
+                turn_index=self._turn_counter,
+                role="user",
+                content=user_message,
+            )
+
+            # 2. Heuristic instant filter: extract high-value turns to ChromaDB immediately
+            if len(user_message.strip()) > 20:
+                candidate = self._heuristic_extractor.evaluate(user_message, role="user")
+                if candidate.should_extract:
+                    # Dedup check before saving
+                    existing = self._store.search(user_message, top_k=1, track_access=False)
+                    is_dup = (
+                        existing
+                        and existing[0].similarity_score is not None
+                        and existing[0].similarity_score >= 0.90
+                    )
+                    if not is_dup:
+                        content = user_message[:300].strip()
+                        self._store.add_memory(
+                            content=content,
+                            keywords=candidate.keywords,
+                            category=candidate.category,
+                            conversation_id=self._conversation_id,
+                            extraction_source="auto",
+                        )
+                        logger.debug(
+                            "Auto-extracted memory from turn %d (category=%s)",
+                            self._turn_counter,
+                            candidate.category,
+                        )
+        except Exception:
+            # Logging/extraction failure must never block search
+            logger.debug("Conversation logging/extraction failed", exc_info=True)
+
         budget = token_budget or self._token_budget
         effective_top_k = top_k or self._top_k
 
diff --git a/src/aimemory/mcp/server.py b/src/aimemory/mcp/server.py
@@ -67,7 +67,7 @@ def _get_bridge() -> MemoryBridge:
 async def memory_save(
     content: str,
     keywords: list[str] | None = None,
-    category: str = "fact",
+    category: str = "auto",
     related_ids: list[str] | None = None,
     immutable: bool = False,
     pinned: bool = False,
@@ -79,11 +79,29 @@ async def memory_save(
         keywords: Optional list of keywords. Auto-extracted if not provided.
         category: Memory category. One of: fact, preference,
             experience, emotion, technical, core_principle.
+            Defaults to "auto" which auto-classifies from content.
         related_ids: Optional list of memory IDs to link as related.
         immutable: If True, memory cannot be updated or deleted.
         pinned: If True, memory is protected from the forgetting pipeline.
     """
     try:
+        # Auto-classify category if "auto"
+        if category == "auto":
+            from aimemory.selfplay.memory_agent import classify_category, extract_keywords
+
+            auto_keywords = keywords or extract_keywords(content)
+            raw = classify_category(content, auto_keywords)
+            cat_map = {
+                "general": "fact",
+                "personal": "fact",
+                "technical": "technical",
+                "preference": "preference",
+            }
+            category = cat_map.get(raw, "fact")
+            # Auto-extract keywords if not provided
+            if keywords is None:
+                keywords = auto_keywords
+
         result = _get_bridge().save_memory(
             content=content,
             keywords=keywords,
diff --git a/src/aimemory/memory/conversation_log.py b/src/aimemory/memory/conversation_log.py
@@ -0,0 +1,149 @@
+"""SQLite-based append-only conversation log.
+
+Stores all conversation turns for batch processing during sleep cycles.
+Uses WAL mode for concurrent read/write safety.
+"""
+
+from __future__ import annotations
+
+import logging
+import sqlite3
+from datetime import datetime, timedelta
+from pathlib import Path
+
+logger = logging.getLogger(__name__)
+
+_SCHEMA = """
+CREATE TABLE IF NOT EXISTS conversation_turns (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    conversation_id TEXT NOT NULL,
+    turn_index INTEGER NOT NULL,
+    role TEXT NOT NULL,
+    content TEXT NOT NULL,
+    timestamp TEXT NOT NULL,
+    processed INTEGER NOT NULL DEFAULT 0
+);
+
+CREATE INDEX IF NOT EXISTS idx_conv_id ON conversation_turns(conversation_id);
+CREATE INDEX IF NOT EXISTS idx_processed ON conversation_turns(processed);
+CREATE INDEX IF NOT EXISTS idx_timestamp ON conversation_turns(timestamp);
+"""
+
+
+class ConversationLog:
+    """Append-only conversation log backed by SQLite.
+
+    All conversation turns are recorded for later batch processing
+    by the sleep cycle memory extraction pipeline.
+    """
+
+    def __init__(self, db_path: str | Path) -> None:
+        self._db_path = str(db_path)
+        self._conn = sqlite3.connect(self._db_path, check_same_thread=False)
+        self._conn.execute("PRAGMA journal_mode=WAL")
+        self._conn.executescript(_SCHEMA)
+        self._conn.commit()
+
+    def append_turn(
+        self,
+        conversation_id: str,
+        turn_index: int,
+        role: str,
+        content: str,
+    ) -> int:
+        """Append a conversation turn. Returns the row id."""
+        now = datetime.now().isoformat()
+        cursor = self._conn.execute(
+            "INSERT INTO conversation_turns (conversation_id, turn_index, role, content, timestamp) "
+            "VALUES (?, ?, ?, ?, ?)",
+            (conversation_id, turn_index, role, content, now),
+        )
+        self._conn.commit()
+        return cursor.lastrowid or 0
+
+    def get_unprocessed_turns(self, limit: int = 500) -> list[dict]:
+        """Get unprocessed turns ordered by conversation and turn index.
+
+        Returns list of dicts with keys: id, conversation_id, turn_index, role, content, timestamp.
+        """
+        cursor = self._conn.execute(
+            "SELECT id, conversation_id, turn_index, role, content, timestamp "
+            "FROM conversation_turns "
+            "WHERE processed = 0 "
+            "ORDER BY conversation_id, turn_index "
+            "LIMIT ?",
+            (limit,),
+        )
+        return [
+            {
+                "id": row[0],
+                "conversation_id": row[1],
+                "turn_index": row[2],
+                "role": row[3],
+                "content": row[4],
+                "timestamp": row[5],
+            }
+            for row in cursor.fetchall()
+        ]
+
+    def get_conversation(self, conversation_id: str) -> list[dict]:
+        """Get all turns for a specific conversation, ordered by turn index."""
+        cursor = self._conn.execute(
+            "SELECT id, conversation_id, turn_index, role, content, timestamp, processed "
+            "FROM conversation_turns "
+            "WHERE conversation_id = ? "
+            "ORDER BY turn_index",
+            (conversation_id,),
+        )
+        return [
+            {
+                "id": row[0],
+                "conversation_id": row[1],
+                "turn_index": row[2],
+                "role": row[3],
+                "content": row[4],
+                "timestamp": row[5],
+                "processed": bool(row[6]),
+            }
+            for row in cursor.fetchall()
+        ]
+
+    def mark_processed(self, turn_ids: list[int]) -> int:
+        """Mark turns as processed. Returns number of rows updated."""
+        if not turn_ids:
+            return 0
+        placeholders = ",".join("?" for _ in turn_ids)
+        cursor = self._conn.execute(
+            f"UPDATE conversation_turns SET processed = 1 WHERE id IN ({placeholders})",
+            turn_ids,
+        )
+        self._conn.commit()
+        return cursor.rowcount
+
+    def cleanup_old(self, days: int = 30) -> int:
+        """Delete processed turns older than `days`. Returns number of rows deleted."""
+        cutoff = (datetime.now() - timedelta(days=days)).isoformat()
+        cursor = self._conn.execute(
+            "DELETE FROM conversation_turns WHERE processed = 1 AND timestamp < ?",
+            (cutoff,),
+        )
+        self._conn.commit()
+        deleted = cursor.rowcount
+        if deleted > 0:
+            logger.info("Cleaned up %d old conversation turns (older than %d days)", deleted, days)
+        return deleted
+
+    def count(self, processed: bool | None = None) -> int:
+        """Count turns, optionally filtered by processed status."""
+        if processed is None:
+            cursor = self._conn.execute("SELECT COUNT(*) FROM conversation_turns")
+        else:
+            cursor = self._conn.execute(
+                "SELECT COUNT(*) FROM conversation_turns WHERE processed = ?",
+                (1 if processed else 0,),
+            )
+        return cursor.fetchone()[0]
+
+    def close(self) -> None:
+        """Close the database connection."""
+        self._conn.close()
diff --git a/src/aimemory/memory/extraction.py b/src/aimemory/memory/extraction.py
diff --git a/src/aimemory/memory/graph_store.py b/src/aimemory/memory/graph_store.py
diff --git a/src/aimemory/memory/sleep_cycle.py b/src/aimemory/memory/sleep_cycle.py

Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,3 @@`
`1`	`1`	`"""Long-Term Memory System for AI assistants."""`
`2`	`2`
`3`		`-__version__ = "0.4.2"`
	`3`	`+__version__ = "0.6.0"`