You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Skip tasks with insufficient samples for pass@k instead of raising ex… (#113)
Summary:
When computing pass@k metrics with k > 1, tasks that have fewer than k samples are now gracefully skipped with a warning log message instead of raising a ValueError that would terminate the entire results processing.
Changes:
- Replace ValueError with warning log when n_samples < k for a task
- Add logging module import and logger instance
- Collect skipped groups and log them with full context (dataset, agent, attack, task_id, and sample count)
- Add check for empty DataFrame after filtering in aggregate_results
- Update docstrings to reflect new behavior (Note instead of Raises)
- Also includes refactoring: remove job_name from group_cols to allow aggregating across multiple runs of the same experiment
- Add generic variant_name support alongside legacy template_short_name
Differential Revision: D92393526
Pulled By: evtimovi
0 commit comments