Skip to content

Enforce SUT uid params#1495

Merged
bkorycki merged 3 commits intomainfrom
enforce-uid-params
Mar 11, 2026
Merged

Enforce SUT uid params#1495
bkorycki merged 3 commits intomainfrom
enforce-uid-params

Conversation

@bkorycki
Copy link
Copy Markdown
Contributor

@bkorycki bkorycki commented Mar 9, 2026

Goal: We should enforce that SUT UID config parameterizations are actually used.

Right now, if we runopenai/gpt-4o;reas=y;mod=y the factory will completely ignore the values after ; and end up running just the regular openai/gpt-4o sut. This is bad because the UID should be meaningful and accurate.

This PR enforces that the values after ; are going to be used/are compatible with the SUT factory.
For example, the meta llama SUT factory can update it's method signature to def make_sut(self, sut_definition: SUTDefinition, moderated: bool=False) -> SUT: and then return either a MetaLlamaSUT or a MetaLlamaModeratedSUT depending on the kwarg value.

We can handle reasoning in the same way.

@bkorycki bkorycki requested review from superdosh and wpietri March 9, 2026 21:42
@bkorycki bkorycki requested a review from a team as a code owner March 9, 2026 21:42
@bkorycki bkorycki temporarily deployed to Scheduled Testing March 9, 2026 21:42 — with GitHub Actions Inactive
@github-actions
Copy link
Copy Markdown

github-actions bot commented Mar 9, 2026

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Copy link
Copy Markdown
Contributor

@wpietri wpietri left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ooh, excellent idea, and good job putting your finger on a problem. I love this. Turning the SUT parameters into method arguments is a great way to do it.

Copy link
Copy Markdown
Contributor

@superdosh superdosh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the live overview; makes a lot of sense!

@bkorycki bkorycki merged commit 58f6b7a into main Mar 11, 2026
2 checks passed
@bkorycki bkorycki deleted the enforce-uid-params branch March 11, 2026 18:11
@github-actions github-actions bot locked and limited conversation to collaborators Mar 11, 2026
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants