You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(feat) Add multiple rounds support for evaluation runs (#19)
* feat: Add multiple rounds support for evaluation runs
- Add rounds query parameter to execute_eval_run endpoint
- Generate unique run_id (UUID) for each round
- Use existing run_id column to group evaluation results
- Update frontend to support rounds input field
- Add comprehensive integration tests for the feature
- Default to 1 round for backward compatibility
* fix: Maintain backward compatibility for eval runs API
- Return single PromptEvalExecutionRunResponse when rounds=1 (default)
- Return array of responses only when rounds > 1
- Update frontend to handle both response types
- Add test for backward compatibility
* docs: Add comprehensive feature summary for eval rounds
* chore: Remove documentation files
* fix: Address PR review feedback for eval rounds
- Remove unwraps and handle errors properly with map_err
- Reverse loop order: iterate over evals first, then rounds
- Add tool calling support with proper error handling
- Maintain backward compatibility for single round requests
---------
Co-authored-by: tunahorse <[email protected]>
0 commit comments