docs(07.1): complete LLM-Based KG Builder Agent phase

Siddharth-Khattar · Siddharth-Khattar · commit 01b042883a73 · 2026-02-08T12:38:06.000+01:00
- 2 plans executed (6 commits), 8/8 must-haves verified
- Plan 01: DB schema evolution + Pydantic schemas
- Plan 02: KG Builder agent runner/prompt/factory + pipeline wiring
- Programmatic KG Builder replaced with LLM-based agent (Gemini Pro)
- VERIFICATION.md confirms all artifacts and wiring
diff --git a/.planning/ROADMAP.md b/.planning/ROADMAP.md
@@ -30,7 +30,7 @@
 | 5 | Agent Flow | Real-time visualization, SSE streaming, HITL dialogs | REQ-VIS-001/001a/002, REQ-INF-004 | ✅ COMPLETE |
 | 6 | Domain Agents | Financial, Legal, Strategy, Evidence agents, Entity taxonomy, Hypothesis evaluation | REQ-AGENT-003/004/005/006/007c/007d/007h, REQ-HYPO-002/003 | ✅ COMPLETE |
 | 7 | Knowledge Storage & Domain Agent Enrichment | DB schema, enriched citations, KG Builder, findings storage, KG API | REQ-AGENT-009, REQ-STORE-001/002, REQ-AGENT-003-006 (enrichment) | ✅ COMPLETE |
-| 7.1 | LLM-Based KG Builder Agent | Replace programmatic KG Builder with LLM agent for curated entities + semantic relationships | REQ-AGENT-009 (revised) | ⏳ IN PROGRESS (1/2) |
+| 7.1 | LLM-Based KG Builder Agent | Replace programmatic KG Builder with LLM agent for curated entities + semantic relationships | REQ-AGENT-009 (revised) | ✅ COMPLETE |
 | 7.2 | Knowledge Graph Frontend (D3.js Enhancement) | Improve D3.js graph with Epstein-inspired layout, physics, sidebars, filtering, document excerpts | REQ-VIS-003 | ⏳ NOT_STARTED |
 | 7.3 | Knowledge Graph Frontend (vis-network) — OPTIONAL | Premium vis-network graph visualization (preserved for experimentation) | REQ-VIS-003 (alternative) | ⏳ DEFERRED |
 | 8 | Synthesis Agent & Intelligence Layer | Cross-referencing, hypotheses, contradictions, gaps, timeline, case summary/verdict | REQ-AGENT-008, REQ-HYPO-*, REQ-WOW-*, REQ-VIS-004/005/006, REQ-TASK-001/002 | ⏳ NOT_STARTED |
@@ -681,13 +681,15 @@ Plans:
 
 **Depends on:** Phase 7 (case_findings + raw entities stored, KG API endpoints exist)
 
-**Status:** ⏳ PLANNED
+**Status:** ✅ COMPLETE (2026-02-08) — 2 plans, 6 commits, 8/8 must-haves verified
+
+**Verification:** `.planning/phases/07.1-llm-kg-builder-agent/07.1-VERIFICATION.md` — 8/8 must-haves verified
 
 **Plans:** 2 plans in 2 waves
 
 Plans:
 - [x] 07.1-01-PLAN.md — DB schema evolution (Alembic migration, ORM columns) + Pydantic schemas (LLM output + API response)
-- [ ] 07.1-02-PLAN.md — KG Builder agent (runner, factory, prompt, input assembly, DB write) + pipeline wiring
+- [x] 07.1-02-PLAN.md — KG Builder agent (runner, factory, prompt, input assembly, DB write) + pipeline wiring
 
 **Context / Why This Change:**
 The Phase 7 programmatic KG Builder produces low-quality graphs because:
@@ -1475,5 +1477,6 @@ For 2 developers working simultaneously:
 *Phase 7.1 (vis-network) planned: 2026-02-07 (3 plans in 3 waves — SUPERSEDED by architecture revision 2026-02-08)
 *Architecture revision: 2026-02-08 (Programmatic KG Builder → LLM-based KG Builder Agent; D3.js retained+enhanced; vis-network deferred to 7.3)
 *Phase 7.1 (LLM KG Builder) planned: 2026-02-08 (2 plans in 2 waves)
+*Phase 7.1 (LLM KG Builder) complete: 2026-02-08 (2 plans, 6 commits, 8/8 must-haves verified)
 *Phase 7.2 (D3.js Enhancement) defined: 2026-02-08
 *Phase 7.3 (vis-network, optional) renumbered: 2026-02-08
diff --git a/.planning/phases/07.1-llm-kg-builder-agent/07.1-VERIFICATION.md b/.planning/phases/07.1-llm-kg-builder-agent/07.1-VERIFICATION.md
@@ -0,0 +1,152 @@
+---
+phase: 07.1-llm-kg-builder-agent
+verified: 2026-02-08T12:40:00Z
+status: passed
+score: 8/8 must-haves verified
+must_haves:
+  truths:
+    - "KgEntity table has columns for aliases, description_brief, description_detailed, domains, source_finding_ids"
+    - "KgRelationship table has columns for evidence_excerpt, source_finding_ids, temporal_context, corroboration_count, confidence"
+    - "KgBuilderOutput Pydantic schema validates cleanly with Gemini structured output constraints"
+    - "Existing KG API responses include the new fields when present"
+    - "KG Builder agent receives all case_findings text + domain entity lists + case description as text-only input"
+    - "KG Builder agent produces a KgBuilderOutput with curated entities (integer IDs) and semantic relationships"
+    - "Old kg_entities and kg_relationships for the case are cleared before writing curated data"
+    - "Pipeline replaces programmatic build_knowledge_graph() with LLM-based KG Builder invocation"
+  artifacts:
+    - path: "backend/alembic/versions/e4b2c1a37f90_evolve_kg_schema_for_llm_builder.py"
+      provides: "Alembic migration adding new columns"
+    - path: "backend/app/models/knowledge_graph.py"
+      provides: "Updated ORM models with new Mapped columns"
+    - path: "backend/app/schemas/kg_builder.py"
+      provides: "KgBuilderOutput Pydantic schema for Gemini structured output"
+    - path: "backend/app/schemas/knowledge_graph.py"
+      provides: "Updated EntityResponse and RelationshipResponse"
+    - path: "backend/app/agents/kg_builder.py"
+      provides: "KgBuilderAgentRunner, input assembly, DB write, run_kg_builder"
+    - path: "backend/app/agents/prompts/kg_builder.py"
+      provides: "KG_BUILDER_SYSTEM_PROMPT"
+    - path: "backend/app/agents/factory.py"
+      provides: "create_kg_builder_agent static method"
+    - path: "backend/app/services/pipeline.py"
+      provides: "Stage 7 replaced with LLM KG Builder invocation"
+  key_links:
+    - from: "backend/app/agents/kg_builder.py"
+      to: "backend/app/schemas/kg_builder.py"
+      via: "KgBuilderOutput used as output_type"
+    - from: "backend/app/agents/kg_builder.py"
+      to: "backend/app/models/knowledge_graph.py"
+      via: "writes KgEntity and KgRelationship ORM models"
+    - from: "backend/app/services/pipeline.py"
+      to: "backend/app/agents/kg_builder.py"
+      via: "pipeline calls run_kg_builder()"
+    - from: "backend/app/agents/factory.py"
+      to: "backend/app/agents/prompts/kg_builder.py"
+      via: "lazy import of KG_BUILDER_SYSTEM_PROMPT"
+gaps: []
+---
+
+# Phase 7.1: LLM-Based KG Builder Agent Verification Report
+
+**Phase Goal:** Replace the programmatic KG Builder with an LLM-based agent that reads ALL domain agent outputs holistically and produces a curated knowledge graph with deduplicated high-level entities and semantic relationships -- enabling cross-domain connections impossible with per-finding co-occurrence.
+
+**Verified:** 2026-02-08T12:40:00Z
+**Status:** PASSED
+**Re-verification:** No -- initial verification
+
+## Goal Achievement
+
+### Observable Truths
+
+| # | Truth | Status | Evidence |
+|---|-------|--------|----------|
+| 1 | KgEntity table has columns for aliases, description_brief, description_detailed, domains, source_finding_ids | VERIFIED | Alembic migration `e4b2c1a37f90` adds all 5 columns as nullable JSONB/String/Text. ORM model has matching `Mapped` fields at lines 79-103 of `knowledge_graph.py`. |
+| 2 | KgRelationship table has columns for evidence_excerpt, source_finding_ids, temporal_context, corroboration_count, confidence | VERIFIED | Same Alembic migration adds all 5 columns. ORM model has matching `Mapped` fields at lines 207-233 of `knowledge_graph.py`. corroboration_count defaults to 1, confidence defaults to 0.0. |
+| 3 | KgBuilderOutput Pydantic schema validates cleanly with Gemini structured output constraints | VERIFIED | `kg_builder.py` schema uses `list[MetadataEntry]` (not `dict`), no Union types, no discriminated unions. Integer ID cross-referencing pattern avoids name-matching. KgBuilderEntity has 11 fields, KgBuilderRelationship has 9 fields, KgBuilderOutput wraps both. |
+| 4 | Existing KG API responses include the new fields when present | VERIFIED | `EntityResponse` has aliases, description_brief, description_detailed, domains, source_finding_ids as Optional fields (lines 34-48 of `knowledge_graph.py` schemas). `RelationshipResponse` has evidence_excerpt, source_finding_ids, temporal_context, corroboration_count, confidence as Optional fields (lines 145-162). Both use `model_config = ConfigDict(from_attributes=True)` for ORM auto-mapping. Generated TypeScript types in `api.ts` include all new fields. |
+| 5 | KG Builder agent receives all case_findings text + domain entity lists + case description as text-only input | VERIFIED | `assemble_kg_builder_input()` at line 113 of `agents/kg_builder.py` queries CaseFinding rows (formatted with `[FINDING:{uuid}]` prefixes grouped by agent_type), extracts DomainEntity lists from domain_results (serialized as JSON), and queries Case.description. Returns 3-tuple of (findings_text, entities_json, case_description). `_prepare_content()` at line 55 builds a text-only `types.Content` from these three sources. |
+| 6 | KG Builder agent produces a KgBuilderOutput with curated entities (integer IDs) and semantic relationships | VERIFIED | `KgBuilderAgentRunner` subclasses `DomainAgentRunner[KgBuilderOutput]` (line 30). `_get_output_type()` returns `KgBuilderOutput`. Factory method at line 315 of `factory.py` creates LlmAgent with `output_schema=KgBuilderOutput`, `output_key="kg_builder_result"`, Pro model, high thinking. System prompt (126 lines) instructs 8+1 entity taxonomy, semantic relationship types, integer ID assignment, deduplication rules, evidence grounding requirements. |
+| 7 | Old kg_entities and kg_relationships for the case are cleared before writing curated data | VERIFIED | `write_kg_from_llm_output()` at line 207 of `agents/kg_builder.py` executes `delete(KgRelationship).where(...)` then `delete(KgEntity).where(...)` then `flush()` before inserting new entities. FK-safe order (relationships first). Entity degrees computed after write via `compute_entity_degrees()`. |
+| 8 | Pipeline replaces programmatic build_knowledge_graph() with LLM-based KG Builder invocation | VERIFIED | `pipeline.py` imports `run_kg_builder` from `app.agents.kg_builder` (line 156). Stage 7 (lines 968-1031) calls `run_kg_builder()` wrapped in try/except. Grep confirms `build_knowledge_graph` does NOT appear in pipeline.py. SSE events fire: `emit_agent_started` (line 986), `emit_agent_complete` (line 1003), `emit_agent_error` (line 1025). Failure is non-blocking -- sets `kg_entities_created=0`, continues pipeline. |
+
+**Score:** 8/8 truths verified
+
+### Required Artifacts
+
+| Artifact | Expected | Status | Details |
+|----------|----------|--------|---------|
+| `backend/alembic/versions/e4b2c1a37f90_evolve_kg_schema_for_llm_builder.py` | Migration adding 10 nullable columns | VERIFIED | 154 lines. Adds 5 columns to kg_entities (aliases, description_brief, description_detailed, domains, source_finding_ids) and 5 to kg_relationships (evidence_excerpt, source_finding_ids, temporal_context, corroboration_count, confidence). Proper upgrade/downgrade. |
+| `backend/app/models/knowledge_graph.py` | Updated ORM models | VERIFIED | 245 lines. KgEntity has all 5 new Mapped fields (lines 79-103). KgRelationship has all 5 new Mapped fields (lines 207-233). Docstrings updated to reference "LLM-based KG Builder agent" and "semantic relationship extraction". |
+| `backend/app/schemas/kg_builder.py` | KgBuilderOutput Pydantic schema | VERIFIED | 141 lines. KgBuilderEntity (11 fields including integer id, aliases, description_brief/detailed, domains, source_finding_ids, properties via MetadataEntry). KgBuilderRelationship (9 fields including evidence_excerpt, temporal_context, confidence). KgBuilderOutput wrapper. ABOUTME comments present. |
+| `backend/app/schemas/knowledge_graph.py` | Updated API response schemas | VERIFIED | 221 lines. EntityResponse has 5 new Optional fields (lines 34-48). RelationshipResponse has 5 new Optional fields (lines 145-162). All use `from_attributes=True`. |
+| `backend/app/agents/kg_builder.py` | KgBuilderAgentRunner + run_kg_builder + input assembly + DB write | VERIFIED | 423 lines. KgBuilderAgentRunner subclass (lines 30-105). assemble_kg_builder_input() (lines 113-199). write_kg_from_llm_output() (lines 207-350). run_kg_builder() top-level (lines 358-423). ABOUTME comments present. |
+| `backend/app/agents/prompts/kg_builder.py` | KG_BUILDER_SYSTEM_PROMPT | VERIFIED | 126 lines. 8+1 entity taxonomy table, entity instructions (7 items), relationship instructions (7 items), what-not-to-do section, input/output format. Instructs semantic labels, evidence grounding, deduplication, cross-domain connections. ABOUTME comments present. |
+| `backend/app/agents/factory.py` | create_kg_builder_agent() | VERIFIED | Method at line 315 (33 lines). Uses MODEL_PRO, thinking_level "high", lazy imports KG_BUILDER_SYSTEM_PROMPT and KgBuilderOutput. No generate_content_config (text-only). Follows exact pattern of other factory methods. |
+| `backend/app/services/kg_builder.py` | Old functions marked deprecated | VERIFIED | 435 lines. `extract_entities_from_output` (line 30), `build_relationships_from_findings` (line 110), `deduplicate_entities` (line 201), `build_knowledge_graph` (line 359) all prefixed with "DEPRECATED: Replaced by LLM-based KG Builder agent (Phase 7.1). Kept for reference." `normalize_entity_name()` and `compute_entity_degrees()` preserved for reuse. |
+| `backend/app/services/pipeline.py` | Stage 7 replaced | VERIFIED | 1147 lines. Stage 7 at lines 968-1031 calls `run_kg_builder()`. try/except wraps invocation. SSE lifecycle events (started/complete/error). Strategy output appended to domain_results before KG Builder (lines 975-983). |
+| `packages/types/src/generated/api.ts` | Frontend types include new fields | VERIFIED | TypeScript types contain aliases, description_brief, description_detailed, domains, source_finding_ids (entity), evidence_excerpt, source_finding_ids, temporal_context, corroboration_count (relationship). |
+
+### Key Link Verification
+
+| From | To | Via | Status | Details |
+|------|----|-----|--------|---------|
+| `agents/kg_builder.py` | `schemas/kg_builder.py` | KgBuilderOutput used as output_type | VERIFIED | Line 42: `_get_output_type() returns KgBuilderOutput`. Line 20: `from app.schemas.kg_builder import KgBuilderOutput`. |
+| `agents/kg_builder.py` | `models/knowledge_graph.py` | writes KgEntity/KgRelationship ORM objects | VERIFIED | Line 19: `from app.models.knowledge_graph import KgEntity, KgRelationship`. write_kg_from_llm_output() creates KgEntity instances (line 251) and KgRelationship instances (line 305). |
+| `services/pipeline.py` | `agents/kg_builder.py` | calls run_kg_builder() | VERIFIED | Line 156: `from app.agents.kg_builder import run_kg_builder`. Line 995: `await run_kg_builder(...)`. Old `build_knowledge_graph` NOT referenced (grep returns no matches). |
+| `agents/factory.py` | `agents/prompts/kg_builder.py` | lazy import of KG_BUILDER_SYSTEM_PROMPT | VERIFIED | Line 335: `from app.agents.prompts.kg_builder import KG_BUILDER_SYSTEM_PROMPT`. Used in `_create_llm_agent()` call at line 341. |
+| `schemas/knowledge_graph.py` | `models/knowledge_graph.py` | from_attributes=True reads new ORM columns | VERIFIED | EntityResponse line 68: `model_config = ConfigDict(from_attributes=True)`. RelationshipResponse line 165: same. Field names match ORM column names exactly. |
+| `agents/kg_builder.py` | `services/kg_builder.py` | reuses normalize_entity_name and compute_entity_degrees | VERIFIED | Line 228: `from app.services.kg_builder import compute_entity_degrees, normalize_entity_name`. Used in write_kg_from_llm_output() at line 254 (normalize_entity_name) and line 337 (compute_entity_degrees). |
+| `agents/kg_builder.py` | `agents/domain_agent_runner.py` | subclasses DomainAgentRunner | VERIFIED | Line 30: `class KgBuilderAgentRunner(DomainAgentRunner[KgBuilderOutput])`. DomainAgentRunner.run() accepts `**kwargs` (line 194) enabling findings_text/entities_json/case_description passthrough. |
+| KG API endpoints | new KG data | GraphResponse uses from_attributes | VERIFIED | `api/knowledge_graph.py` line 98: `EntityResponse.model_validate(e)` -- automatically picks up all new ORM columns (aliases, description_brief, etc.) via from_attributes. No API route changes needed. |
+
+### Requirements Coverage
+
+| Requirement | Status | Blocking Issue |
+|-------------|--------|----------------|
+| REQ-AGENT-009 (revised -- LLM-based KG Builder) | SATISFIED | None. Agent reads all findings holistically, produces curated entities with semantic relationships. |
+
+### Anti-Patterns Found
+
+| File | Line | Pattern | Severity | Impact |
+|------|------|---------|----------|--------|
+| None found | -- | -- | -- | No TODO, FIXME, placeholder, or stub patterns detected in any of the 10 artifacts. |
+
+### Human Verification Required
+
+#### 1. Graph Quality Test
+**Test:** Run the full pipeline on a multi-file case (e.g., financial + legal documents) and inspect the resulting knowledge graph via `GET /api/cases/:caseId/graph`.
+**Expected:** ~15-30 curated entities (not 50+ noisy ones), relationships with semantic labels like "CEO of" or "transferred_funds_to" (not "co-occurrence"), entities with populated aliases arrays, relationships with evidence_excerpt fields containing actual quotes.
+**Why human:** Graph quality depends on Gemini Pro's response to the prompt. Structural verification confirms the pipeline exists and is wired correctly, but cannot predict LLM output quality.
+
+#### 2. Cross-Domain Deduplication Test
+**Test:** Upload files that mention the same person across financial and legal domains (e.g., "J. Smith" in financial docs, "John Smith" in legal docs). Check if the KG Builder merges them into one entity with both as aliases.
+**Expected:** Single entity with aliases `["J. Smith", "John Smith"]` and domains `["financial", "legal"]`.
+**Why human:** Deduplication quality depends on the LLM's ability to recognize the same person across different name variants and contexts.
+
+#### 3. No Standalone Timestamp/Amount Nodes Test
+**Test:** Inspect the graph for entities of type that should be metadata (e.g., "$5,000", "2023-01-15", "Model XYZ"). These should NOT be standalone nodes.
+**Expected:** Monetary amounts, timestamps, and physical objects appear as properties on entities/relationships, not as standalone graph nodes.
+**Why human:** The prompt instructs this, but LLM compliance varies.
+
+#### 4. Pipeline Graceful Failure Test
+**Test:** Trigger a KG Builder failure (e.g., by temporarily breaking the Gemini API key) and verify the pipeline continues past Stage 7.
+**Expected:** Error logged, SSE error event emitted, pipeline continues to Stage 7b and final steps, case status becomes READY (not ERROR).
+**Why human:** Requires runtime execution to verify error handling path.
+
+### Gaps Summary
+
+No gaps found. All 8 must-haves are verified at all three levels (exists, substantive, wired):
+
+1. **DB Schema Evolution** -- Alembic migration adds 10 nullable columns, ORM models have matching Mapped fields, API response schemas expose new fields as Optional.
+2. **Pydantic Output Schema** -- KgBuilderOutput with integer ID cross-referencing, Gemini-compatible (no dict types), 3 models totaling 141 lines.
+3. **KG Builder Agent** -- KgBuilderAgentRunner subclass with text-only content preparation, input assembly from case_findings + domain entities + case description, DB write with clear-and-rebuild strategy, lenient parsing, entity degree computation.
+4. **System Prompt** -- 126-line prompt with 8+1 entity taxonomy, semantic relationship instructions, deduplication rules, evidence grounding requirements, anti-patterns ("do NOT create standalone timestamp nodes").
+5. **Factory Method** -- create_kg_builder_agent() in AgentFactory, Pro model, high thinking, text-only (no media resolution).
+6. **Pipeline Wiring** -- Stage 7 calls run_kg_builder() instead of build_knowledge_graph(), wrapped in try/except for graceful failure, SSE lifecycle events for started/complete/error.
+7. **Old Code Preserved** -- 4 programmatic functions marked deprecated, normalize_entity_name() and compute_entity_degrees() reused.
+8. **API Compatibility** -- Existing KG API endpoints return curated data without any route changes (from_attributes auto-maps new ORM columns).
+
+---
+
+_Verified: 2026-02-08T12:40:00Z_
+_Verifier: Claude (gsd-verifier)_