Add evalview-agent-testing — regression testing for AI agents by hidai25 · Pull Request #153 · BehiSecc/awesome-claude-skills

hidai25 · 2026-03-23T12:11:55Z

Adds EvalView to the development tools section.

EvalView provides regression testing for AI agents — snapshot behavior, detect when tool calls or output quality drift, and block broken agents before production.

Features:

Golden baseline diffing
Python API (gate()) for autonomous loops
MCP server (8 tools) for Claude Code
CI/CD with PR comments
Multi-turn test support

Published on PyPI (pip install evalview), Apache 2.0 license.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: add evalview-agent-testing skill

55b64ef

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add evalview-agent-testing — regression testing for AI agents#153

Add evalview-agent-testing — regression testing for AI agents#153
hidai25 wants to merge 1 commit intoBehiSecc:mainfrom
hidai25:feat/add-evalview

hidai25 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hidai25 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant