feat: allow users to define task groups in LMEvalJob#471
feat: allow users to define task groups in LMEvalJob#471yhwang wants to merge 1 commit intotrustyai-explainability:mainfrom
Conversation
Reviewer's GuideThis PR adds support for user-defined TaskGroups within LMEvalJob by extending the Kubernetes API schema, controller validation, CLI flags, command-generation logic, and driver functionality to accept, validate, and produce grouped task definitions alongside existing tasks. File-Level Changes
Possibly linked issues
Tips and commandsInteracting with Sourcery
Customizing Your ExperienceAccess your dashboard to:
Getting Help
|
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
provide a simple LMEvalJob which uses the new TaskGroups: |
There was a problem hiding this comment.
Hey @yhwang - I've reviewed your changes and they look great!
Here's what I looked at during the review
- 🟡 General issues: 3 issues found
- 🟢 Security: all looks good
- 🟢 Testing: all looks good
- 🟢 Documentation: all looks good
Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.
|
PR image build and manifest generation completed successfully! 📦 PR image: 📦 LMES driver image: 📦 LMES job image: 📦 Guardrails orchestrator image: 🗂️ CI manifests |
Add new field: TaskGroups under the TaskList to support custom task group. User can define a custom task group and specify a list of aggregate metrics. In the result JSON, the task groups have a dedicated section of their results. Signed-off-by: Yihong Wang <yh.wang@ibm.com>
|
Hi @ruivieira, thanks for the review and comments. I applied this pattern |
|
@yhwang: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Add new field: TaskGroups under the TaskList to
support custom task group. User can define a
custom task group and specify a list of aggregate
metrics. In the result JSON, the task groups have
a dedicated section of their results.
related: #445
Summary by Sourcery
Enable defining named task groups with custom aggregate metrics in LMEvalJob and propagate them through the CLI, controller, and driver
New Features:
Enhancements:
Tests: