docs: close Feature 084 — update all documentation

davidmatousek · claude · davidmatousek · commit cd9b71aec2b0 · 2026-04-08T11:16:35.000-04:00
Product: PRD INDEX (Delivered), User Stories, OKRs (PM)
Architecture: Tech Stack, ADR-020 MAESTRO classification (Architect)
DevOps: No changes needed (content-only feature)
KB: KB-021 — taxonomy overlay propagation pattern
Delivery: retrospective, metrics, delivery.md
Cleanup: branch deleted, tasks verified, BACKLOG.md regenerated

Co-Authored-By: Claude &lt;noreply@anthropic.com&gt;
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -98,6 +98,12 @@ When invoked as a subagent (via Agent tool), return ONLY:
 - Review `agent-assignments.md` for workload distribution
 
 ## Recent Changes
+- **Feature 084**: MAESTRO Layer Mapping (CSA seven-layer taxonomy for agentic AI)
+  - New `maestro_layer` field in `schemas/finding.yaml` (schema_version 1.1 to 1.2)
+  - Orchestrator Phase 1 keyword classification, finding inheritance, SARIF tags
+  - Downstream propagation: risk-scorer, control-analyzer, threat-report
+  - New shared reference: `.claude/skills/tachi-shared/references/maestro-layers-shared.md`
+  - All 6 example outputs regenerated with MAESTRO layer columns
 - **Feature 086**: Automated Release Tagging via GitHub Actions
   - release-please workflow for version tagging and CHANGELOG generation on merge to main
   - New files: `.github/workflows/release-please.yml`, `release-please-config.json`, `.release-please-manifest.json`
diff --git a/docs/INSTITUTIONAL_KNOWLEDGE.md b/docs/INSTITUTIONAL_KNOWLEDGE.md
@@ -3,9 +3,9 @@
 **Project**: tachi - Automated threat modeling toolkit extending STRIDE with AI-specific threat agents for agentic applications
 **Purpose**: Capture learnings, patterns, and solutions to prevent repeated mistakes
 **Created**: {{PROJECT_START_DATE}}
-**Last Updated**: 2026-04-06
+**Last Updated**: 2026-04-08
 
-**Entry Count**: 20 / 20 (KB System Upgrade triggers at 20 — schedule review)
+**Entry Count**: 21 / 20 (KB System Upgrade triggers at 20 — schedule review)
 **Last Review**: 2026-03-30
 **Status**: ✅ Manual mode (file-based)
 
@@ -410,6 +410,29 @@ Captured during structured delivery retrospective. Smooth sailing — everything
 
 ---
 
+### KB-021: Taxonomy Overlay Features Propagate Smoothly Through Finding IR
+
+**Date**: 2026-04-08
+**Category**: Architecture
+**Source**: Feature 084 retrospective
+**Severity**: Informational
+
+**Problem**: Uncertainty about whether adding a new classification dimension (MAESTRO layers) to the existing pipeline would require significant refactoring of downstream agents and output formats.
+
+**Root Cause**: The finding IR was designed with optional extensible fields, and downstream agents already follow a passive propagation pattern — they include whatever fields exist in the finding without gating on them. This architecture makes taxonomy overlays a natural fit.
+
+**Solution**: Feature 084 added `maestro_layer` as an optional field in the finding schema, classified components in Phase 1 via keyword matching, and let the field flow passively to all downstream outputs. No agent detection logic, scoring formulas, or dispatch rules required changes. All 22 tasks completed in 2 days across 5 waves.
+
+**Result**: Implementation was smoother and faster than the 2-3 day estimate. The keyword-based classification achieved 95.2% accuracy on existing examples. Example regeneration validated the full pipeline end-to-end in a single pass.
+
+**When to Apply**: Any future feature that adds a classification dimension to findings (e.g., compliance framework mapping, kill-chain phase tagging). Follow the same pattern: define taxonomy in shared reference, classify in Phase 1, extend schema with optional field, let downstream agents propagate passively.
+
+**Tags**: #retrospective #architecture #taxonomy #pipeline #maestro
+
+**Quality Score**: 8/10
+
+---
+
 ## Bug Fixes
 
 *No entries yet. Use `/kb-create` to add the first bug fix.*
diff --git a/docs/architecture/00_Tech_Stack/README.md b/docs/architecture/00_Tech_Stack/README.md
diff --git a/docs/architecture/02_ADRs/ADR-020-maestro-layer-classification.md b/docs/architecture/02_ADRs/ADR-020-maestro-layer-classification.md
@@ -0,0 +1,123 @@
+# ADR-020: MAESTRO Layer Classification for Threat Findings
+
+**Status**: Accepted
+**Date**: 2026-04-08
+**Deciders**: Architect
+**Feature**: 084 (MAESTRO Layer Mapping)
+
+---
+
+## Context
+
+tachi's threat model pipeline classifies components by DFD element type (External Entity, Process, Data Store, Data Flow) during Phase 1 (Scope), then dispatches STRIDE and AI threat agents accordingly. However, the pipeline had no awareness of where components sit within an agentic AI architecture stack. This limited the ability to:
+
+1. **Identify architectural concentration risk**: Threats clustered at one layer (e.g., many threats against Foundation Model components) were invisible without manual review.
+2. **Align with industry taxonomy**: Security teams using CSA's MAESTRO framework could not map tachi findings to their existing layer-based security controls.
+3. **Enable layer-based risk aggregation**: The Risk Summary section could not break down findings by architectural layer, making it harder to prioritize remediation across system tiers.
+
+**Constraints**:
+- Classification must not require user-provided layer annotations -- it must be automatic.
+- Classification must be deterministic and auditable (no LLM-based classification of component types).
+- The new field must propagate through the entire pipeline without breaking existing consumers.
+- Classification must handle components that do not fit any layer gracefully (no errors for unclassifiable components).
+
+---
+
+## Decision
+
+We will adopt the **CSA MAESTRO seven-layer taxonomy** as the architectural layer classification framework, implemented via **keyword-based substring matching** during Phase 1 with ordered evaluation (L1 through L7, first match wins).
+
+The implementation adds:
+1. A shared reference file (`maestro-layers-shared.md`) with the canonical keyword-to-layer mapping table.
+2. An optional `maestro_layer` field on the finding IR schema (`schemas/finding.yaml`, schema_version 1.1 to 1.2).
+3. Phase 1 classification: after DFD type assignment, each component is classified by matching its name, description, and DFD type against layer keywords.
+4. Phase 3 inheritance: each finding inherits `maestro_layer` from its target component.
+5. Passive propagation through downstream agents (risk-scorer, control-analyzer, threat-report).
+
+---
+
+## Rationale
+
+**Reasons**:
+1. **Industry alignment**: CSA MAESTRO is the only published seven-layer taxonomy specifically designed for agentic AI architectures. Using it provides a shared vocabulary with the broader security community.
+2. **Keyword matching reuses proven pattern**: The AI dispatch mechanism (Feature 007) already uses case-insensitive keyword matching to classify components. MAESTRO classification follows the same pattern, reducing implementation risk.
+3. **Deterministic and auditable**: Keyword matching produces the same result every run for the same input, unlike LLM-based classification which could vary between invocations. The keyword table is inspectable and versioned.
+4. **Non-breaking schema extension**: The `maestro_layer` field defaults to "Unclassified", making it backward-compatible with existing threat models and consumers that do not use it.
+5. **Shared reference prevents drift**: Following ADR-019 (shared definitions), the keyword table lives in `tachi-shared` as a single source of truth consumed by all agents that need layer awareness.
+
+---
+
+## Alternatives Considered
+
+### Alternative 1: LLM-Based Component Classification
+**Pros**:
+- Could handle ambiguous component names more intelligently
+- No keyword table maintenance
+
+**Cons**:
+- Non-deterministic: same component could be classified differently across runs
+- Not auditable: no inspectable mapping table
+- Adds inference cost and latency to Phase 1
+- Conflicts with tachi's deterministic pipeline design principle
+
+**Why Not Chosen**: Determinism and auditability are critical for security tooling. Keyword matching provides both while handling the vast majority of real-world component names correctly.
+
+### Alternative 2: User-Provided Layer Annotations
+**Pros**:
+- Most accurate classification possible
+- No false positives from keyword matching
+
+**Cons**:
+- Adds mandatory input burden on every threat model run
+- Breaks the current zero-configuration input contract
+- Users unfamiliar with MAESTRO would produce inaccurate annotations
+
+**Why Not Chosen**: Automatic classification with graceful fallback ("Unclassified") delivers value without requiring users to learn a new taxonomy.
+
+### Alternative 3: Custom Layer Taxonomy
+**Pros**:
+- Could be tailored exactly to tachi's needs
+- No dependency on external framework updates
+
+**Cons**:
+- Reinvents an existing standard without adding value
+- No shared vocabulary with the security community
+- Additional maintenance burden to keep custom taxonomy current
+
+**Why Not Chosen**: CSA MAESTRO is well-aligned with tachi's target domain (agentic AI). A custom taxonomy would duplicate effort without meaningful benefit.
+
+---
+
+## Consequences
+
+### Positive
+- Threat models now include architectural layer context for every component and finding
+- Risk Summary includes a "Risk by MAESTRO Layer" breakdown enabling layer-based prioritization
+- SARIF output tags findings with MAESTRO layer for downstream tooling integration
+- Aligns tachi with CSA's industry-standard framework for agentic AI security
+
+### Negative
+- Keyword matching can misclassify components with ambiguous names (e.g., "API Gateway" could be L4 or L7)
+- Evaluation order (L1-L7) is load-bearing -- reordering keywords changes classification
+- Adding new layers or modifying keywords requires regression testing against all example outputs
+
+### Mitigation
+- Evaluation order follows a specificity gradient (most specific first) to minimize ambiguity
+- "Unclassified" default ensures graceful degradation for unrecognized components
+- All 6 example outputs were regenerated and validated after implementation
+- Keyword table changes are documented with a warning about ordering sensitivity
+
+---
+
+## Related Decisions
+
+- [ADR-019](ADR-019-shared-definitions-and-model-field-governance.md): Shared definitions pattern that governs where the MAESTRO keyword table lives
+- [ADR-003](ADR-003-stride-per-element-dispatch.md): STRIDE-per-Element dispatch pattern that MAESTRO classification extends
+
+---
+
+## References
+
+- `.claude/skills/tachi-shared/references/maestro-layers-shared.md` -- Canonical keyword-to-layer mapping
+- `schemas/finding.yaml` -- Finding IR schema with `maestro_layer` field (v1.2)
+- Cloud Security Alliance, "MAESTRO: Multi-Agent Environment Security Toolkit for Reasoning and Orchestration", February 2025
diff --git a/docs/product/02_PRD/INDEX.md b/docs/product/02_PRD/INDEX.md
@@ -1,11 +1,11 @@
 # PRD Index
 
-**Last Updated**: 2026-04-07
+**Last Updated**: 2026-04-08
 
 
 | # | Feature | PM | Architect | Team-Lead | Status | Date |
 |---|---------|----|-----------|-----------| -------|------|
-| 084 | [MAESTRO Layer Mapping](084-maestro-layer-mapping-2026-04-07.md) | ⚠ | ⚠ | ⚠ | Approved | 2026-04-07 |
+| 084 | [MAESTRO Layer Mapping](084-maestro-layer-mapping-2026-04-07.md) | ✓ | ✓ | ✓ | Delivered | 2026-04-08 |
 | 086 | [Automated Release Tagging via GitHub Actions](086-automated-release-tagging-via-github-actions-2026-04-06.md) | ✓ | ✓ | ✓ | Delivered | 2026-04-06 |
 | 066 | [Install Script and Version Tagging](066-install-script-and-version-tagging-2026-04-06.md) | ✓ | ✓ | ✓ | Delivered | 2026-04-06 |
 | 078 | [Agent Context Optimization](078-agent-context-optimization-2026-04-01.md) | ✓ | ✓ | ✓ | Delivered | 2026-04-02 |
diff --git a/docs/product/05_User_Stories/README.md b/docs/product/05_User_Stories/README.md
@@ -1,6 +1,6 @@
 # User Stories - tachi
 
-**Last Updated**: 2026-04-02
+**Last Updated**: 2026-04-08
 **Owner**: Product Manager (product-manager)
 **Status**: Template - Complete after MVP launch
 
@@ -373,3 +373,13 @@ Each PRD should include relevant user stories:
 - **US-078-3** (P1): Model Field Assignment - All 17 tachi agent definitions updated with explicit model fields for optimal delegation routing across agent tiers (Leaf, Report, Methodology)
 - **US-078-4** (P1): Best Practices Documentation Update - Shared _TACHI_AGENT_BEST_PRACTICES.md updated with restructuring patterns, skill reference conventions, and compliance verification for all 17 agents
 - **US-078-5** (P0): Zero Regression Validation - All restructured agents validated against example threat models with equivalent output structure, severity counts, and SARIF compliance preserved
+
+### Feature 084: MAESTRO Layer Mapping
+
+**PRD**: [084-maestro-layer-mapping](../02_PRD/084-maestro-layer-mapping-2026-04-07.md)
+**Delivered**: 2026-04-08 | **PR**: #92 | **Tasks**: 22/22 complete | **Stories**: 4/4 passing
+
+- **US-084-1** (P0): Layer-Tagged Threat Findings - Each finding in STRIDE and AI threat tables includes a MAESTRO Layer column, derived from component classification in Phase 1 with "Unclassified" default for unmatched components
+- **US-084-2** (P0): Phase 1 Component Classification - Orchestrator classifies each component by MAESTRO layer using keyword matching against name, description, and DFD type during Phase 1 (Scope), with dispatch table showing MAESTRO Layer column
+- **US-084-3** (P0): SARIF Layer Tags - SARIF results include `maestro-layer:{layer-name}` in properties.tags array and `maestro-layer` key in properties for security tooling filtering
+- **US-084-4** (P1): Layer-Based Risk Summary - Risk summary in threats.md includes "Risk by MAESTRO Layer" subsection showing finding count and highest severity per layer, omitting layers with zero findings
diff --git a/docs/product/06_OKRs/README.md b/docs/product/06_OKRs/README.md
@@ -1,6 +1,6 @@
 # OKRs (Objectives and Key Results) - tachi
 
-**Last Updated**: 2026-04-02
+**Last Updated**: 2026-04-08
 **Owner**: Product Manager (product-manager)
 **Status**: Template - Complete after MVP launch
 
@@ -164,3 +164,4 @@ OKRs align the team around measurable goals. They answer:
 | 2026-04-01 | F-074: Baseline-Aware Pipeline | [074](../02_PRD/074-baseline-aware-pipeline-2026-03-31.md) | Baseline-aware threat detection pipeline with 4-phase orchestration (carry-forward, isolated discovery, merge/dedup, coverage gate). Correlates findings across runs with stable IDs via SARIF fingerprints. Delta annotations (`[NEW]`, `[UNCHANGED]`, `[UPDATED]`, `[RESOLVED]`) on all outputs. Coverage checklists per STRIDE category. Extended orchestrator, risk-scorer, and control-analyzer agents. New coverage-checklists schema. Updated all output templates and SARIF properties. Unblocks #55 (Security Progression Summary). |
 | 2026-04-02 | F-078: Agent Context Optimization | [078](../02_PRD/078-agent-context-optimization-2026-04-01.md) | Restructured 6 tachi agents (orchestrator, risk-scorer, control-analyzer, report-assembler, threat-report, threat-infographic) from monolithic prompts to lean definitions with on-demand skill references. Created 4 skill directories (tachi-orchestration, tachi-risk-scoring, tachi-report-assembly, tachi-shared) with 25+ granular reference files. Added explicit model fields to all 17 agent definitions. Reduced agent prompt sizes by 40-60%. Zero regression on threat model outputs. |
 | 2026-04-06 | F-086: Automated Release Tagging via GitHub Actions | [086](../02_PRD/086-automated-release-tagging-via-github-actions-2026-04-06.md) | Google release-please GitHub Action for automated version tagging from conventional commits. Deliverables: release-please.yml workflow, release-please-config.json, .release-please-manifest.json (baseline v4.0.0), README Releases section. Eliminates manual `git tag` commands; maintainer controls release timing via Release PR merge. |
+| 2026-04-08 | F-084: MAESTRO Layer Mapping | [084](../02_PRD/084-maestro-layer-mapping-2026-04-07.md) | CSA MAESTRO seven-layer taxonomy overlay for all threat findings. New schema field, shared reference, orchestrator keyword classification in Phase 1, SARIF tags, downstream agent propagation, layer-based risk summary. All 6 example outputs regenerated. 4 user stories delivered. |
diff --git a/docs/product/_backlog/BACKLOG.md b/docs/product/_backlog/BACKLOG.md
@@ -1,6 +1,6 @@
 # Backlog
 
-> Auto-generated from GitHub Issues on 2026-04-07T23:22:23Z.
+> Auto-generated from GitHub Issues on 2026-04-08T15:15:56Z.
 > Source of truth: GitHub Issues with `stage:*` labels.
 > Regenerate: `/aod.status` or `.aod/scripts/bash/backlog-regenerate.sh`
 
@@ -37,15 +37,14 @@
 
 | # | Title | Delivered | Retro | Updated |
 |---|-------|-----------|-------|---------|
-| — | *No items in this stage* | | |
+| #84 | MAESTRO layer mapping — CSA seven-layer taxonomy overlay for threat findings | 2026-04-08 | — | 2026-04-08 |
 
 ## Untracked
 
 > These issues have no `stage:*` label. Add a label to track them in the lifecycle.
 
 | # | Title | State | Updated |
 |---|-------|-------|---------|
-| #84 | MAESTRO layer mapping — CSA seven-layer taxonomy overlay for threat findings | OPEN | 2026-04-07 |
 | #27 | Developer Guide: Automated Threat Modeling for Your Architecture | CLOSED | 2026-03-24 |
 | #18 | Feature: Threat Infographic Agent | CLOSED | 2026-03-23 |
 | #15 | Feature 007: Threat Report Agent & Attack Trees | CLOSED | 2026-03-23 |
diff --git a/specs/084-maestro-layer-mapping/delivery.md b/specs/084-maestro-layer-mapping/delivery.md
@@ -0,0 +1,74 @@
+# Delivery Report — Feature 084: MAESTRO Layer Mapping
+
+**Feature**: 084 — MAESTRO Layer Mapping
+**Branch**: `084-maestro-layer-mapping`
+**PR**: #92 (squash-merged)
+**Delivered**: 2026-04-08
+**Issue**: #84
+
+---
+
+## Delivery Metrics
+
+| Metric | Value |
+|--------|-------|
+| Tasks completed | 22/22 |
+| Execution waves | 5 |
+| Estimated duration | 2-3 days |
+| Actual duration | 2 days (2026-04-07 → 2026-04-08) |
+| Checkpoints passed | P0 ✅, P1 ✅, P2 ✅ |
+
+## Validation Results
+
+| Criterion | Result |
+|-----------|--------|
+| SC-001: Classification rate | 95.2% — PASS (>90%) |
+| SC-003: Non-MAESTRO diffs | Zero — PASS |
+| Architect review | APPROVED_WITH_CONCERNS (1 low — resolved) |
+| Code review | APPROVED (after fixes) |
+| Security scan | Skipped (no code or manifest files changed) |
+
+## Key Deliverables
+
+1. **Shared reference**: `.claude/skills/tachi-shared/references/maestro-layers-shared.md` — canonical MAESTRO taxonomy with keyword mappings
+2. **Schema extension**: `schemas/finding.yaml` — new `maestro_layer` field, schema_version 1.1 → 1.2
+3. **Orchestrator classification**: Phase 1 MAESTRO keyword matching + finding inheritance
+4. **SARIF tags**: `maestro-layer:{layer}` in `properties.tags[]`
+5. **Downstream propagation**: risk-scorer, control-analyzer, threat-report all propagate MAESTRO layer
+6. **Example outputs**: All 6 example architectures regenerated with MAESTRO layer columns
+7. **Output schema template**: `templates/tachi/output-schemas/threats.md` updated
+
+## User Stories Delivered
+
+| Story | Priority | Status |
+|-------|----------|--------|
+| US-1: Layer-Tagged Threat Findings | P0 | ✅ Delivered |
+| US-2: Phase 1 Component Classification | P0 | ✅ Delivered |
+| US-3: SARIF Layer Tags | P0 | ✅ Delivered |
+| US-4: Layer-Based Risk Summary | P1 | ✅ Delivered |
+
+## Surprise Log
+
+**Smoother than expected** — Implementation went faster than anticipated. The taxonomy overlay pattern (optional field + passive propagation) proved to be a natural fit for the existing finding IR architecture. Keyword-based classification was deterministic and required no model changes. Downstream agents needed only column additions, not logic changes.
+
+## Lessons Learned
+
+**KB-021**: Taxonomy overlay features propagate smoothly through the finding IR. The optional-field + passive-propagation pattern makes adding new classification dimensions (MAESTRO layers, compliance mappings, kill-chain phases) straightforward. Future taxonomy features should follow the same approach: shared reference → Phase 1 classification → schema extension → passive propagation.
+
+## Documentation Updates
+
+| Domain | Agent | Files Updated |
+|--------|-------|---------------|
+| Product | PM | PRD INDEX, User Stories, OKRs |
+| Architecture | Architect | Tech Stack, ADR-020 (new), CLAUDE.md, system design (pre-existing) |
+| DevOps | DevOps | No changes needed (content-only feature) |
+| KB | — | INSTITUTIONAL_KNOWLEDGE.md (KB-021) |
+
+## Triad Sign-offs
+
+| Role | Artifact | Status | Date |
+|------|----------|--------|------|
+| PM | spec.md | APPROVED | 2026-04-07 |
+| PM | tasks.md | APPROVED | 2026-04-07 |
+| Architect | tasks.md | APPROVED_WITH_CONCERNS | 2026-04-07 |
+| Team Lead | tasks.md | APPROVED | 2026-04-07 |