feat(prompts): harden SDD planning, clarification, and review workflows by ryderstorm · Pull Request #50 · liatrio-labs/spec-driven-workflow

ryderstorm · 2026-03-17T15:44:27Z

Summary

This branch significantly strengthens the prompt-driven workflow across planning, spec generation, implementation evidence, and validation. The net effect is a more opinionated and reviewable SDD loop that blocks weak planning earlier, asks fewer unnecessary questions during spec generation, and produces artifacts that are easier for humans to inspect.

What changed

1. Add a mandatory planning-audit checkpoint in SDD-2

insert a required audit between task generation and implementation handoff
require human-approved remediation before planning defects can be corrected
allow existing spec/task work to resume through audit/remediation without regenerating the plan
add compact Gateboard-first reporting, re-audit deltas, and a reset/restart recommendation when planning debt is too high

2. Make planning audit decisions more reliable

require explicit repository-standards discovery before audit decisions are made
add a required standards evidence table so audit conclusions are grounded in actual repo guidance files instead of inferred behavior
add a chain-of-verification pass before handing work to SDD-3
clarify failure handling when standards files or other planning evidence are missing
tighten proof-artifact and requirement-to-test expectations during task planning

3. Improve SDD-1 spec generation and clarification behavior

replace the old always-generate-a-questions-file behavior with a clarification sufficiency check
allow SDD-1 to proceed directly to spec generation when the request is already clear enough
still force a questions file when ambiguity is material, requirements would be guessed, or current best practices leave meaningful approach choices open
require generated questions files to include recommended answers and rationale so users get guided choices instead of a flat questionnaire
add example guidance showing the intended recommendation style

4. Ground specs in current external guidance, not just repo patterns

require latest-standards research for technologies that materially affect the spec
prefer recent official or primary-source guidance when available
route repo-pattern-vs-modern-guidance conflicts back through the clarification flow when the right direction is unclear
require technical considerations in the spec to incorporate relevant current best practices or call out intentional deviations

5. Improve task and proof artifact review UX

change the SDD-2 Relevant Files section from a bullet list to a markdown table for easier scanability
require SDD-3 proof artifacts to lead with human-readable context instead of starting with raw command output
require descriptive proof titles, summary-first layout, per-artifact "what it proves / why it matters" framing, and short reviewer conclusions
require screenshot paths to be shown above inline embedded images so proof docs support faster visual review
add an example proof document shape that demonstrates the expected format

6. Strengthen validation semantics in SDD-4

refine file-integrity handling so unmapped core/source drift is blocker-level, while supporting-file linkage gaps are documented without automatically failing validation
tighten evidence-quality expectations so validation looks at proof usability, not just proof existence
explicitly check for descriptive proof titles, front-loaded context, inline screenshot presentation, and explanation before raw evidence
add clearer examples of non-blocking reviewability issues in validation output

7. Update docs to match the new workflow

update README.md to describe the audit-first planning flow, conditional questions files, current-standards grounding, and generated artifact set
extend docs/faq.md with planning-audit/remediation behavior so the docs tell the same story as the prompts

Why this matters

Before this branch, the workflow was easier to move through quickly, but it also allowed weak planning handoffs, over-eager question generation, and proof artifacts that were technically present but slow for humans to validate. These changes push the workflow toward:

better planning discipline before implementation starts
fewer unnecessary clarification loops
more grounded technical direction for modern stacks
faster human review of both plans and proof artifacts
clearer validation outcomes when evidence quality is weak

Files changed

README.md
docs/faq.md
prompts/SDD-1-generate-spec.md
prompts/SDD-2-generate-task-list-from-spec.md
prompts/SDD-3-manage-tasks.md
prompts/SDD-4-validate-spec-implementation.md

Validation

This is a prompt/docs-only branch. Validation focused on prompt/doc linting and consistency checks.

pre-commit run --files "prompts/SDD-1-generate-spec.md" "README.md"
pre-commit run --files "prompts/SDD-3-manage-tasks.md" "prompts/SDD-4-validate-spec-implementation.md"
pre-commit run --files "prompts/SDD-2-generate-task-list-from-spec.md"

Reviewer notes

this PR intentionally spans multiple prompts because the changes are workflow-coupled: planning audit behavior affects implementation handoff, proof artifacts affect validation, and spec clarification changes affect downstream planning quality
there are no application/runtime code changes in this branch; the impact is entirely in workflow behavior and generated markdown artifacts

PaulHLiatrio · 2026-03-17T16:22:41Z

prompts/SDD-1-generate-spec.md

+## Step 4: Clarification Sufficiency Check

-Ask clarifying questions to gather sufficient detail. Focus on understanding the "what" and "why" rather than the "how."
+Assess whether you already have enough aligned context to write a high-quality spec without inventing requirements. Always err on the side of caution, but do not force a questions file when the available information is already sufficient.


I wonder if we're leaning towards giving it too much freedom in this instance here; I reflect on when my AI ignored all my skills because it felt it could do it better and even called itself "overconfident".

I would almost prefer that it ask some clarifying questions about approach but mark what it thinks we want, and that could potentially introduce other methods we weren't aware of to the development process.

I can't count the amount of times I was on client and SDD asked me some questions, and I actually had to pull up with the team to discuss the approach because I hadn't considered the other options.

Adding on to this -- that being said maybe that can be up to the writer to include that in their prompt when generating the SDD. This could be the default, but if you want to hear different design choices you can ask for that.

This is a great callout, and your example is an important use case.

In my practice with these changes, i've seen it ask questions when i've given it very little to start with, and avoid it when things are good enough already. I've been pleased with the results so far, it's giving the right level of interaction. Let me know if you think it needs some tweaking after you've run it through a few times.

prompts/SDD-1-generate-spec.md

PaulHLiatrio · 2026-03-17T16:29:06Z

prompts/SDD-1-generate-spec.md

-Follow this format exactly when you create the questions file.
+Follow this format exactly when you create a questions file.
+
+Each question MUST include recommended answer guidance for the user. Recommendations should reduce ambiguity, explain tradeoffs, and bias toward the option that best supports a clear, reviewable, junior-friendly spec.


I'd almost want it to provide some reading material as well to speed up the decision making process if you're unfamiliar with the material.

interesting idea! i worry that leans too much into using SDD to just build anything, tho - the user should have some familiarity with what they're trying to build. SDD is designed for inputs that are already groomed, at least a little.

Yeah it was mostly the "junior-friendly spec" portion that brought this thought to me. In my mind part of Juniors job is learning more about what's being done, so helping with that process would be a part of making it "junior-friendly"

prompts/SDD-3-manage-tasks.md

Enforce planning audit gates between task generation and implementation so required planning defects are blocked before execution begins. Add audit-aware auto-selection and planning-resume behavior so existing specs with tasks can continue through audit/remediation without unwanted task regeneration. Co-Authored-By: GPT-5 Codex <no-reply@example.com> Co-authored-by: Cursor <cursoragent@cursor.com>

Clarify that SDD-2 now includes a mandatory planning audit checkpoint with human-approved remediation before implementation handoff. Update README workflow and artifact outputs, and extend FAQ with planning-quality and remediation behavior details. Co-Authored-By: GPT-5 Codex <no-reply@example.com> Co-authored-by: Cursor <cursoragent@cursor.com>

Improve SDD-2 audit readability by defaulting to compact Gateboard and exception-only reporting, with delta-focused re-audit output for faster human review cycles. Add a deterministic reset recommendation gate that triggers restart vs remediation decisions when planning debt exceeds defined thresholds. Co-Authored-By: GPT-5 Codex <no-reply@example.com> Co-authored-by: Cursor <cursoragent@cursor.com>

Add explicit standards-discovery and evidence checkpoints so repository guidance files are consistently read before audit decisions. Introduce a required chain-of-verification step before implementation handoff and define failure handling for missing standards artifacts. Keep audit output compact with a required standards evidence table and minimal pass-case reporting.

Introduce tiered Gate D rules to distinguish blocker-level core file drift from non-blocking supporting-file linkage gaps. Clarify file integrity expectations across gates, detailed checks, red flags, and report examples to improve validation consistency and traceability. Co-Authored-By: Codex <no-reply@example.com>

Clarify spec-generation review flow by nesting the cross-domain guard under Step 6 and replacing undefined terminology with context-aware wording. Improve validation guidance with explicit evidence quality and artifact sufficiency checks, and add a direct GATE D linkage clarification reference. Co-Authored-By: Codex <no-reply@example.com>

Allow SDD-1 to skip questions files when context is already sufficient by requiring an explicit clarification sufficiency check before spec generation. Add recommended-answer guidance to generated questions files so users get direction and tradeoff notes when clarification is needed. Co-Authored-By: OpenCode <no-reply@example.com>

Require SDD-1 to research current best practices for material technologies so specs are informed by modern guidance as well as repository patterns. Route technology-driven ambiguity through the questions file flow, including best-practice context and user confirmation when current guidance leaves meaningful implementation choices open. Co-Authored-By: OpenCode <no-reply@example.com>

Require SDD-3 proof files to lead with reviewer-friendly summaries, per-artifact context, and inline screenshots so humans can validate work faster. Teach SDD-4 to treat sparse proof formatting as an evidence-quality issue by checking for descriptive titles, front-loaded context, and inline image presentation. Co-Authored-By: OpenCode <no-reply@example.com>

Update SDD-2 so generated task files present the Relevant Files section as a markdown table instead of a bullet list. This makes planned file coverage easier to scan during task review without changing the underlying workflow. Co-Authored-By: OpenCode <no-reply@example.com>

Replace the non-dictionary Gateboard label in the SDD-2 prompt so the repository spell-check gate passes cleanly. This keeps the audit instructions readable while aligning with CI lint rules. Co-Authored-By: AI Assistant <no-reply@example.com>

prompts/SDD-1-generate-spec.md

leehopper · 2026-03-18T02:26:34Z

prompts/SDD-2-generate-task-list-from-spec.md

+Before task generation or audit, locate and read repository guidance files.
+
+Required search targets (if present):
+


I know AGENTS.md is becoming the standard, but is it worth still including CLAUDE.md for now as well?

good call! this section is a little too strict, too. It should start with something like

Examples of required search targets (if present):

FWIW mine still found CLAUDE.md and some other files like .gitlab-ci.yml that weren't explicitly listed! It noted that AGENTS.md was not in the repository, but didn't fail because of that

ryderstorm changed the title ~~feat(prompts): strengthen SDD planning and review workflows~~ feat(prompts): harden SDD planning, clarification, and review workflows Mar 17, 2026

ryderstorm force-pushed the feat/add-spec-review-checkpoint branch from 6012d97 to 3b0cda6 Compare March 17, 2026 15:48

ryderstorm marked this pull request as ready for review March 17, 2026 15:49

ryderstorm self-assigned this Mar 17, 2026

PaulHLiatrio reviewed Mar 17, 2026

View reviewed changes

ryderstorm force-pushed the feat/add-spec-review-checkpoint branch from 3b0cda6 to 147db0c Compare March 17, 2026 17:44

ryderstorm and others added 10 commits March 17, 2026 16:13

ryderstorm force-pushed the feat/add-spec-review-checkpoint branch from 147db0c to 024cd5d Compare March 17, 2026 20:14

docs(prompts): rename Gateboard references

13b6cf0

Replace the non-dictionary Gateboard label in the SDD-2 prompt so the repository spell-check gate passes cleanly. This keeps the audit instructions readable while aligning with CI lint rules. Co-Authored-By: AI Assistant <no-reply@example.com>

leehopper reviewed Mar 18, 2026

View reviewed changes

prompts/SDD-1-generate-spec.md Show resolved Hide resolved

leehopper reviewed Mar 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(prompts): harden SDD planning, clarification, and review workflows#50

feat(prompts): harden SDD planning, clarification, and review workflows#50
ryderstorm wants to merge 11 commits intomainfrom
feat/add-spec-review-checkpoint

ryderstorm commented Mar 17, 2026 •

edited

Loading

Uh oh!

PaulHLiatrio Mar 17, 2026

Uh oh!

PaulHLiatrio Mar 17, 2026

Uh oh!

ryderstorm Mar 17, 2026

Uh oh!

Uh oh!

Uh oh!

PaulHLiatrio Mar 17, 2026

Uh oh!

ryderstorm Mar 17, 2026

Uh oh!

PaulHLiatrio Mar 17, 2026

Uh oh!

Uh oh!

Uh oh!

leehopper Mar 18, 2026

Uh oh!

ryderstorm Mar 18, 2026

Uh oh!

maddiehawkins Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		Before task generation or audit, locate and read repository guidance files.

		Required search targets (if present):

Conversation

ryderstorm commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

1. Add a mandatory planning-audit checkpoint in SDD-2

2. Make planning audit decisions more reliable

3. Improve SDD-1 spec generation and clarification behavior

4. Ground specs in current external guidance, not just repo patterns

5. Improve task and proof artifact review UX

6. Strengthen validation semantics in SDD-4

7. Update docs to match the new workflow

Why this matters

Files changed

Validation

Reviewer notes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ryderstorm commented Mar 17, 2026 •

edited

Loading