Skip to content

Conversation

@mmabrouk
Copy link
Member

Summary

  • Updates evaluation + testset docs/examples to use the current async SDK (ag.init, ag.testsets.*, aevaluate) instead of legacy client.testsets.* and legacy /api/testsets/* endpoints.
  • Fixes code blocks so they run as regular Python scripts (wrap async examples with asyncio.run(...)), while still working in notebooks/Colab.
  • Clarifies testset versioning by distinguishing testset_id (stable) from revision id (changes per update).

Why

Legacy /api/testsets/* endpoints are deprecated and currently problematic; these docs were still guiding users toward the old API shape.

Validation

  • Verified against production (https://cloud.agenta.ai) using uv run scripts (SDK CRUD + direct API calls + aevaluate).

Notes

  • API reference docs remain auto-generated from OpenAPI; this PR does not manually edit them.

Switch examples from legacy /testsets APIs to ag.testsets + aevaluate, and clarify revision vs testset IDs.
@vercel
Copy link

vercel bot commented Jan 15, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Review Updated (UTC)
agenta-documentation Ready Ready Preview, Comment Jan 15, 2026 7:32pm

Review with Vercel Agent

@dosubot dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Jan 15, 2026
@dosubot dosubot bot added documentation Improvements or additions to documentation Evaluation SDK labels Jan 15, 2026
@bekossy bekossy changed the base branch from main to release/v0.77.4 January 16, 2026 10:53
@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 16, 2026
@bekossy bekossy merged commit ec51851 into release/v0.77.4 Jan 16, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation Evaluation lgtm This PR has been approved by a maintainer SDK size:XL This PR changes 500-999 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants