LLM dataset limit updates by farook-edev · Pull Request #1111 · mlcommons/mobile_app_open

farook-edev · 2026-02-23T22:50:30Z

This PR addressed the points discussed in #1098 regarding datasets (specifically MMLU).

It does the following:

Prevent performance mode from running when query count is 0
set input/output token limits to 2048/1024 for IFEval and 2048/(4|128) for MMLU.
Update accuracy string format for MMLU and IFEval (from Accuracy: 50% to 50% for an accuracy of 0.5)

enable MMLU dataset to apply 2 different output_token limits based on run mode update token limits for MMLU and IFEval update accuracy string format for MMLU and IFEval

github-actions · 2026-02-23T22:50:41Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

sonarqubecloud · 2026-02-23T23:24:03Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

prevent performance mode from running when query count is 0

39163e8

enable MMLU dataset to apply 2 different output_token limits based on run mode update token limits for MMLU and IFEval update accuracy string format for MMLU and IFEval

farook-edev requested review from a team and anhappdev as code owners February 23, 2026 22:50

farook-edev mentioned this pull request Feb 23, 2026

Update v6.0 LLM Implementation #1098

Open

9 tasks

freedomtan approved these changes Feb 24, 2026

View reviewed changes

farook-edev merged commit 960c94a into submission-v6.0 Feb 24, 2026
40 of 41 checks passed

farook-edev deleted the mmlu_token_limits branch February 24, 2026 09:55

github-actions bot locked and limited conversation to collaborators Feb 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM dataset limit updates#1111

LLM dataset limit updates#1111
farook-edev merged 1 commit intosubmission-v6.0from
mmlu_token_limits

farook-edev commented Feb 23, 2026

Uh oh!

github-actions bot commented Feb 23, 2026

Uh oh!

sonarqubecloud bot commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

farook-edev commented Feb 23, 2026

Uh oh!

github-actions bot commented Feb 23, 2026

Uh oh!

sonarqubecloud bot commented Feb 23, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants