auto-triage bot improvements by FredKSchott · Pull Request #15513 · withastro/astro

FredKSchott · 2026-02-14T06:28:48Z

Lots of small improvements as we finally start to get some broken / incomplete triage runs back from the auto-triage bot.

Summary

New: Docker sandbox for issue triage: Run the LLM (OpenCode server) inside an isolated Docker container during triage workflows so untrusted reproduction code never has access to secrets. Adds a Dockerfile.sandbox, a GHCR build workflow, and updates the triage workflow to use --sandbox. Moves the compiler clone into .compiler/ (gitignored) so it's accessible inside the container's bind mount.
New: Add a verify step to the triage pipeline that checks whether reported behavior is intentional before attempting a fix. Fixes issues where the bot just trusted the submitting user's expected behavior as truth vs. potentially confused/incorrect on expected behavior.
New: Make diagnose and fix skills aware of the withastro/compiler repo (cloned as a sibling in CI). Fixes issues tracked back to the compiler, where the bot was trying to work around the issue in our astro codebase instead of pointing responsibility to the compiler.
New: Add a feasibility check to the fix skill for browser/runtime compatibility. Hopefully fixes issues where the bot suggests code that wouldn't run on modern browsers.
Fix: For some reason the reproduction instructions were gone (or never there?) so we hadn't been downloading repos/stackblitz, and probably spending quite a lot of time trying to figure out the bug without a reproduction. Kind of surprised by the success rate at reproductions, given this, but I guess everyone is including enough detail without it for the LLM to go off of.
Chore: Ensure all skills explicitly read report.md before appending to it
Chore: Simplify the diagnose skill's review step
Chore: Refactor issue-triage.ts into composable helper functions.
Chore: Ignore triage folder from eslint
Chore: Tidy up AGENTS.md, simplified the project layout section
Chore: Bump @flue/cli to 0.0.20 and @flue/client to 0.0.12

Testing

No good way to test CI locally, so will need to test a bit post-merge.

Adds a new verification phase between diagnose and fix that researches whether reported behavior is an actual bug or intended design. This prevents wasting effort attempting fixes for non-bugs.

The withastro/compiler repo may be cloned as a sibling directory. Instructs the diagnose skill to check it when stack traces point to compiler behavior, and the fix skill to document proposed compiler changes in report.md.

Adds a new step to verify browser/runtime compatibility before implementing a fix. Also adds compiler repo awareness and renumbers steps accordingly.

Adds verifyResultSchema and runs verify after diagnose. Skips the fix step when verification determines behavior is intended. Also clones the compiler repo in CI so diagnose/fix skills can reference it.

Adds explicit 'read report.md' to the critical instruction at the top of every skill file, so agents always load prior context before appending their own findings.

The bullet list was redundant — the agent reads report.md and extracts whatever context it needs. Replace with a single instruction.

Extracts shouldRetriage, selectTriageLabels, fetchIssue, and runTriagePipeline from the monolithic triage function. Inlines schemas next to their call sites, adds early returns for non-reproducible and intended-behavior cases, and validates the issue response with valibot.

changeset-bot · 2026-02-14T06:28:54Z

⚠️ No Changeset found

Latest commit: d5bcddb

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

This reverts commit ef0ea98.

…hanges

* fix(markdoc): sync custom attributes between tags and nodes with shared names In Markdoc, `table` exists as both a tag (`{% table %}`) and a node (the inner table structure). When users configure custom attributes on `nodes.table` or `tags.table`, the AST propagates those attributes to both the tag and node, but validation only checks the schema for each type independently. This caused "Invalid attribute" errors when attributes were declared on only one side. Add `syncTagNodeAttributes()` to automatically merge attribute declarations between tags and nodes that share the same name after config setup, so users can define attributes on either side. Fixes #14220 * chore: clarify why explicit types are needed on builtinTags/builtinNodes

…to fix'

…m sandbox The sandbox subagents had no access to issue data because the orchestrator fetched it but never passed it through. This caused the reproduce skill to attempt gh CLI calls which fail without a token in the sandbox. - Extract IssueDetails valibot schema and type in issue-triage.ts - Fetch additional fields (author, labels, state, authorAssociation, etc.) - Pass issueDetails as args to reproduce, diagnose, verify, and fix skills - Add issueDetails prerequisite to diagnose, verify, and fix skill docs - Replace gh CLI commands in sandbox-run skills with curl/git alternatives - Use author_association field for maintainer detection instead of gh api - Remove gh CLI from sandbox Dockerfile (not usable without token)

…to comment skill The comment skill now receives available priority labels and selects one as part of rendering the comment. This makes the priority judgment visible in the posted comment (answering 'how bad is it?') and lets the downstream label selector simply extract it rather than deciding independently. - Extract fetchRepoLabels helper with valibot validation - Pass priorityLabels to comment skill, packageLabels to label selector - Simplify selectTriageLabels to extract priority from comment + pick packages

- Add assert helper for runtime invariant checks - Validate flue.args with valibot instead of unsafe type cast - Add v.nonEmpty() to label result schema, remove dead null guard - Assert fetched label arrays are non-empty before proceeding

Fred K. Schott added 10 commits February 13, 2026 21:12

chore: clean up AGENTS.md structure and wording

68110d6

auto-triage: add verify step to check if behavior is intentional

adeae83

Adds a new verification phase between diagnose and fix that researches whether reported behavior is an actual bug or intended design. This prevents wasting effort attempting fixes for non-bugs.

auto-triage: add feasibility check and compiler support to fix skill

6d811d5

Adds a new step to verify browser/runtime compatibility before implementing a fix. Also adds compiler repo awareness and renumbers steps accordingly.

auto-triage: wire verify step into the workflow pipeline

51ad98e

Adds verifyResultSchema and runs verify after diagnose. Skips the fix step when verification determines behavior is intended. Also clones the compiler repo in CI so diagnose/fix skills can reference it.

update workflows

e6f1983

chore: bump @flue/cli to 0.0.20 and @flue/client to 0.0.12

f0d7fbd

auto-triage: ensure all skills read report.md before writing

12b5a13

Adds explicit 'read report.md' to the critical instruction at the top of every skill file, so agents always load prior context before appending their own findings.

auto-triage: simplify diagnose review step

77632f7

The bullet list was redundant — the agent reads report.md and extracts whatever context it needs. Replace with a single instruction.

FredKSchott changed the title ~~Fks/triage 6~~ auto-triage bot improvements Feb 14, 2026

github-actions bot added the 🚨 action Modifies GitHub Actions label Feb 14, 2026

Fred K. Schott added 10 commits February 13, 2026 22:32

ignore triage folder in eslint

c53c97c

update workflows

4f5a134

auto-triage: run LLM in Docker sandbox for secret isolation

178d208

chore: update @flue/cli to 0.0.21 (sandbox support)

f17a286

fix: lowercase Docker image tags for GHCR compatibility

1d9ac93

chore: rename sandbox image to flue-astro-sandbox

ef0ea98

Revert "chore: rename sandbox image to flue-astro-sandbox"

106d612

This reverts commit ef0ea98.

chore: trigger sandbox image build on workflow file changes

fc1e9f5

chore: use local context for Docker build, trigger on workflow file c…

2d84393

…hanges

fix: use shell-level $IMAGE var so lowercase override applies

568977e

Princesseuh approved these changes Feb 14, 2026

View reviewed changes

Fred K. Schott added 6 commits February 14, 2026 15:53

chore: update @flue/cli to 0.0.22 (sandbox proxy fix)

7109d74

chore: update @flue/cli to 0.0.23

eeb2cc6

chore: update @flue/cli to 0.0.24

af45241

chore: update @flue/cli to 0.0.25

9c05202

chore: update @flue/cli to 0.0.26

0b569b3

fix: make sandbox Dockerfile work for non-root container users

a1ba219

AhmadYasser1 and others added 2 commits February 14, 2026 16:44

triage: improve Fix line to distinguish 'already fixed' from 'unable …

6b5947f

…to fix'

github-actions bot added the pkg: integration Related to any renderer integration (scope) label Feb 15, 2026

Fred K. Schott and others added 11 commits February 14, 2026 16:58

chore: update @flue/cli to 0.0.27 and @flue/client to 0.0.14

ea2969b

ci: add sandbox image verification step to issue triage workflow

a6f0335

chore: update @flue/cli to 0.0.28 and @flue/client to 0.0.15

2a274db

triage: inline fetchIssue and fix double-parsing of issueDetails

193c66e

triage: switch sandbox base image to node:24-bookworm-slim

8726744

triage: simplify sandbox gh CLI comment

951262f

chore: update @flue/cli to 0.0.30 and @flue/client to 0.0.17

baa6d74

Merge branch 'main' into fks/triage-6

a741385

github-actions bot removed the pkg: integration Related to any renderer integration (scope) label Feb 15, 2026

docs: clarify AGENTS.md monorepo description

d5bcddb

FredKSchott merged commit 40f10bb into main Feb 15, 2026
21 checks passed

FredKSchott deleted the fks/triage-6 branch February 15, 2026 05:59

This was referenced Feb 15, 2026

auto-triage improvements: fix tests, subagents, triage directory #15522

Merged

auto-triage improvements: fix labeling #15523

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

auto-triage bot improvements#15513

auto-triage bot improvements#15513
FredKSchott merged 40 commits intomainfrom
fks/triage-6

FredKSchott commented Feb 14, 2026 •

edited

Loading

Uh oh!

changeset-bot bot commented Feb 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

FredKSchott commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

changeset-bot bot commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

FredKSchott commented Feb 14, 2026 •

edited

Loading

changeset-bot bot commented Feb 14, 2026 •

edited

Loading