Skip to content

Feature/7 living doc copilot#9

Draft
miroslavpojer wants to merge 35 commits into
masterfrom
feature/7-living-doc-copilot
Draft

Feature/7 living doc copilot#9
miroslavpojer wants to merge 35 commits into
masterfrom
feature/7-living-doc-copilot

Conversation

@miroslavpojer
Copy link
Copy Markdown
Contributor

Summary

Closes #7.

This PR delivers the full Living Documentation agent suite — two new custom VS Code agents (@living-doc-copilot and @living-doc-bdd-copilot) backed by 11 skills, automation scripts, eval suites, a shared glossary, and user-facing guides.


Changes

Agents

  • @living-doc-copilot — Requirements-layer agent. Creates, updates, heals, and plans User Story / Feature / Functionality entities in the living doc catalog. Storage-format agnostic (prompts for a Storage Profile on first use).
  • @living-doc-bdd-copilot — Automation-layer agent. Explores web apps via MCP Playwright, generates PageObjects, produces Gherkin scenarios and step definitions, and maintains the BDD suite through RE-SCAN / HEALING / REMOVE modes.
  • Eval suites added for both agents (evals.json + trigger-eval.json).

Skills (living-doc domain)

Skill Purpose
living-doc-create-user-story Create well-formed User Stories with business-level ACs
living-doc-create-feature Document system surfaces (screens, endpoints, modules) as Feature entities
living-doc-create-functionality Define atomic, testable behaviors with Functionality-level ACs
living-doc-update Add ACs, change status, deprecate existing entities
living-doc-gap-finder Audit catalog completeness; surface orphan tests and untested ACs
living-doc-impact-analysis Trace which entities are affected by a code or contract change
living-doc-scenario-creator Generate Gherkin scenario skeletons from User Story ACs
living-doc-pageobject-scan Discover/maintain PageObjects from a live webapp or test codebase
gherkin-scenario Write BDD scenarios in business language
gherkin-step Implement step definitions (behave/Cucumber Java/TS/Scala)
gherkin-living-doc-sync Keep Gherkin scenarios and AC link headers in sync with the catalog

Each skill ships with evals.json, trigger-eval.json, fixture files, and a scripts/ directory.

Automation scripts

Python helpers bundled with each skill:

  • next_id.py — auto-increment entity IDs (create-* skills)
  • compute_gaps.py — diff catalog vs test coverage (gap-finder)
  • trace_impact.py — reverse-trace changed modules to living-doc entities (impact-analysis)
  • validate_entity.py — schema-check entity YAML/JSON files (update)
  • scan_ac_links.py — lint .feature files for AC link header compliance (gherkin-living-doc-sync)
  • manifest_diff.py — detect PageObject drift between scan runs (pageobject-scan)
  • coverage_report.py — map scenarios to AC coverage (scenario-creator)

Shared reference

  • skills/references/living-doc-glossary.md — canonical term definitions shared across all living-doc skills.

Documentation

  • docs/guides/living-doc-copilot.md — usage guide for @living-doc-copilot
  • docs/guides/living-doc-bdd-copilot.md — usage guide for @living-doc-bdd-copilot
  • docs/guides/token-saving.md — guide for the token-saving skill
  • docs/testing/agent-testing.md — how to run agent evals
  • docs/testing/skill-testing.md — moved from docs/ into the testing/ subfolder
  • Updated README.md, CONTRIBUTING.md, docs/README.md, and docs/getting-started.md

CI

  • Minor fix to .github/workflows/check_pr_release_notes.yml (added title parameter).

Release Notes

  • @living-doc-copilot agent — maintains the living documentation catalog (User Stories, Features, Functionalities) directly from VS Code chat.
  • @living-doc-bdd-copilot agent — crawls web apps with MCP Playwright and generates PageObjects, Gherkin scenarios, and step definitions tied to living doc ACs.
  • 11 new living-doc skills covering the full requirements-to-BDD pipeline: living-doc-create-user-story, living-doc-create-feature, living-doc-create-functionality, living-doc-update, living-doc-gap-finder, living-doc-impact-analysis, living-doc-scenario-creator, living-doc-pageobject-scan, gherkin-scenario, gherkin-step, gherkin-living-doc-sync.
  • Shared living-doc glossary (skills/references/living-doc-glossary.md).
  • Automation scripts (Python) bundled with each skill for ID management, gap detection, impact tracing, entity validation, AC link linting, PageObject diff, and scenario coverage reporting.
  • Eval suites (evals.json + trigger-eval.json) for all new skills and agents.
  • User guides under docs/guides/ and testing documentation under docs/testing/.
  • docs/testing/skill-testing.md moved from docs/ to docs/testing/.
  • README.md, CONTRIBUTING.md, docs/README.md, and docs/getting-started.md updated to reflect the new skill catalog.
  • check_pr_release_notes.yml — added missing title parameter to release notes presence check.

…mpact tracing, and entity validation

- Implemented `next_id.py` for auto-assigning IDs to user stories, features, functionalities, and acceptance criteria.
- Created `compute_gaps.py` to analyze a catalog snapshot and identify gaps in documentation and testing coverage.
- Developed `trace_impact.py` to trace the impact of code changes on features, functionalities, and user stories based on a catalog.
- Added `validate_entity.py` to validate living doc entities against a canonical schema, ensuring required fields and referential integrity.
- Implemented `scan_ac_links.py` to check for missing or malformed AC link headers in Gherkin scenarios.
- Validates AC ID format and checks for duplicates within the same feature file.

docs: create gherkin-scenario skill documentation

- Added `SKILL.md` for `gherkin-scenario` detailing standards for writing BDD scenarios in Gherkin.
- Covers traceability requirements, language use, and anti-pattern avoidance.

docs: create gherkin-step skill documentation

- Added `SKILL.md` for `gherkin-step` outlining best practices for implementing Gherkin step definitions.
- Emphasizes keeping steps thin, encapsulating selectors, and sharing state correctly.

docs: create living-doc-pageobject-scan skill documentation

- Added `SKILL.md` for `living-doc-pageobject-scan` detailing how to generate and maintain PageObject classes.
- Describes modes for creating and maintaining PageObjects, including selector preferences and output artifacts.

feat: add manifest diff script for PageObject validation

- Implemented `manifest_diff.py` to compare the manifest against PageObject files on disk.
- Identifies stale manifest entries and undocumented PageObjects.

docs: create living-doc-scenario-creator skill documentation

- Added `SKILL.md` for `living-doc-scenario-creator` detailing the process of generating BDD scenarios from User Stories.
- Includes workflow steps for mapping acceptance criteria to scenarios and identifying missing steps.

feat: add coverage report script for AC tracking

- Implemented `coverage_report.py` to generate a report on AC coverage by Gherkin scenarios.
- Scans feature files for AC links and compares them against User Stories to identify gaps.
- Updated Gherkin step definitions to remove Java and Scala references, focusing on Python behave and TypeScript Cucumber.
- Enhanced Living Doc PageObject scan to include TypeScript examples and clarified output artifact locations.
- Revised Living Doc Scenario Creator to improve missing step handling and stub generation, ensuring better integration with PageObjects.
- Created a comprehensive implementation roadmap for the Agentic Engineering Toolkit, detailing progress, file layout, and validation checklists.
…n skills with new trigger phrases and improved descriptions
- Introduced trigger evaluations for Gherkin living doc sync to identify sync-related queries.
- Added evaluation scenarios for Gherkin scenario creation, focusing on writing and reviewing BDD scenarios.
- Implemented evaluations for Gherkin step definitions, emphasizing the distinction between Gherkin text and step binding code.
- Created evaluations for living doc page object scanning, including bootstrap and maintain modes for PageObject generation.
- Established evaluations for living doc scenario creation, generating BDD scenarios from user stories and handling coverage reports.
…nality

- Updated SKILL.md for living-doc-create-functionality to clarify Functionality naming and acceptance criteria elicitation.
- Enhanced living-doc-create-user-story to streamline narrative elicitation and improve AC generation process.
- Revised living-doc-gap-finder to normalize script output and report gaps more effectively.
- Improved living-doc-impact-analysis to flag missing coverage and provide a re-test checklist.
- Enhanced living-doc-pageobject-scan to better handle fragile selectors and update PageObjects.
- Updated living-doc-scenario-creator to ensure accurate scenario generation and coverage reporting.
- Refined living-doc-update to maintain AC ID stability and improve documentation practices.
- Added gap-report.json to track documentation coverage and identify gaps in User Stories and Functionalities.
@miroslavpojer miroslavpojer self-assigned this May 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add living-doc skills and @living-doc-copilot agent

1 participant