Commit 1dfa3ec
feat: add agentic-architect plugin with architecture design and plugin review skills
* feat: add agentv-plugin-review skill and eval
Add a new skill to agentv-dev for reviewing AI plugin PRs across three
dimensions: skill quality, eval correctness, and workflow architecture.
Includes:
- SKILL.md with 5-step review process
- references/eval-checklist.md for eval file review
- references/workflow-checklist.md for OpenSpec-based workflow review
- 8-test eval covering missing evals, naming, assertions, file paths,
repeated inputs, hard gates, factual contradictions, and command refs
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add workspace-based eval with mock plugin and planted issues
Rewrite eval to use pi-cli target with a mock workspace containing a
deploy-auto plugin with planted issues: missing eval, wrong extensions,
no assertions, relative paths, repeated inputs, missing hard gates,
factual contradiction, nonexistent command reference, and hardcoded paths.
Uses shorthand input, skill-trigger assertions, and workspace template
instead of loading SKILL.md directly.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix: use default target instead of invalid pi-cli
The pi-coding-agent provider is configured as the 'default' target
in .agentv/targets.yaml. pi-cli is not a valid target name.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add pi-cli target and use it for plugin-review eval
Add pi-cli as a named target in .agentv/targets.yaml using the
pi-coding-agent provider. Update the plugin-review eval to use it.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add skill quality checklist from Superpowers writing-skills
Add references/skill-quality-checklist.md covering CSO (Claude Search
Optimization), description anti-patterns, content quality, file org,
flowchart usage, and discipline-enforcing skill checks.
Update SKILL.md Step 2 to reference the checklist and integrate key
CSO principles (description must not summarize workflow, etc).
Remove plugin.json from review table (handled by CI).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: create agentic-architect plugin with architecture design and plugin review skills
Move agent-plugin-review from agentv-dev to new agentic-architect plugin.
Add agent-architecture-design skill with species taxonomy (Coding Harness,
Dark Factory, Auto Research, Orchestration) and workflow patterns from
OpenSpec, Superpowers, and Compound Engineering.
New plugin structure:
- agent-architecture-design: diagnose problem, select species, design workflow
- agent-plugin-review: review skills, evals, and workflow architecture
Register in both .claude-plugin and .github marketplace.json.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add pi-coding-agent target and fix pi-cli provider
- pi-cli target now uses provider: pi-cli (matching its name)
- pi-coding-agent target added as explicit alias matching default config
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: remove codex cwd and log_dir from targets.yaml
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* refactor: replace "species" terminology with industry-standard "agentic design patterns"
Rename species-taxonomy.md to agentic-design-patterns.md. Replace all
"Species A/B/C/D" labels with standard terms: Single-Agent Iterative Loop,
Autonomous Pipeline, Optimization Loop, Multi-Agent System.
Based on Andrew Ng's agentic design patterns framework and industry
conventions (cognitive architectures, agentic workflows).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: remove name attribution from design patterns reference
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: rename evals/architecture to evals/self
These evals validate the agentv repo itself, not architecture patterns.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add deterministic linting scripts for eval and plugin review
- agentv-eval-review: new skill in agentv-dev with lint_eval.py script
for .eval.yaml naming, file paths, assertions, prose detection, repeated inputs
- agent-plugin-review: add lint_plugin.py script for frontmatter, hardcoded
paths, version printing, missing evals, referenced files, command refs
- Remove eval-checklist.md from agent-plugin-review (handled by agentv-eval-review)
- Both SKILLs are now minimal wrappers: run script first, then LLM judgment
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add self-consistency and cross-file consistency checks
Address remaining LLM-judgment gaps from PR #324 review:
- Internal self-consistency (skill contradicting itself)
- Cross-file consistency (filenames/tools matching across skills and evals)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: simplify consistency check to single line
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: remove .entire, .claude, .github/hooks, .opencode/plugins
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: rename evals/self/dataset.eval.yaml to eval.yaml
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: remove execution target from self eval
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: flag manual routing workarounds as weak description signal
AGENTS.md or instruction files with heavy TRIGGER/ACTION routing tables
indicate skill descriptions aren't enabling auto-discovery. Good
descriptions make manual routing unnecessary.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* feat: add external dependencies check to plugin review
Skills that depend on MCP servers, external repos, specific directory
layouts, or services should declare these explicitly.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* refactor: remove external dependencies check, add agentskills.io references
External dependencies is not a best practice per agentskills.io — the
compatibility field and gotchas sections handle this. The real issue
(hardcoded paths) is already caught by lint_plugin.py.
Add agentskills.io specification, best practices, and description
optimization as external references for discoverable guidance.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: add agentskills.io using-scripts reference
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* ci: add markdown link validation workflow
Check relative markdown links on push to main and PRs using lychee.
Offline mode only (local links, not HTTP URLs).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix: repair 12 broken markdown links across 3 files
- apps/cli/README.md: add ../../ prefix for repo-root references
- examples/showcase/offline-grader-benchmark/README.md: fix CLAUDE.md depth
- packages/eval/README.md: agentv-eval-builder → agentv-eval-writer
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix: exclude apps/examples symlink from link checker
apps/examples is a symlink to ../examples which causes lychee to
resolve relative paths incorrectly (../../../CLAUDE.md resolves to
apps/CLAUDE.md instead of repo root CLAUDE.md).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: remove apps/examples symlink
Unused symlink to ../examples that confused lychee link checker.
examples/ is accessible directly at repo root.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* refactor: symlink apps/cli/README.md to root README
apps/cli/README.md is published to npm and was a stale copy of the
root README with diverging relative links. Replace with a symlink so
it stays in sync automatically.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 940988a commit 1dfa3ec
File tree
33 files changed
+1597
-922
lines changed- .agentv
- .claude-plugin
- .claude
- .entire
- .github
- hooks
- plugin
- workflows
- .opencode/plugins
- apps
- cli
- evals
- agentic-architect
- workspace-template
- evals/deploy-auto
- plugins/deploy-auto
- commands
- skills
- deploy-execute
- references
- deploy-plan
- references
- deploy-rollback
- self
- examples/showcase/offline-grader-benchmark
- packages/eval
- plugins
- agentic-architect/skills
- agent-architecture-design
- references
- agent-plugin-review
- references
- scripts
- agentv-dev/skills/agentv-eval-review
- scripts
33 files changed
+1597
-922
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
13 | 25 | | |
14 | 26 | | |
15 | 27 | | |
16 | | - | |
17 | | - | |
18 | 28 | | |
19 | 29 | | |
20 | 30 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
19 | 24 | | |
20 | 25 | | |
21 | 26 | | |
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
19 | 24 | | |
20 | 25 | | |
21 | 26 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
This file was deleted.
0 commit comments