Skip to content

Pull requests: open-compass/opencompass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update README.md for mmlu
#2374 opened Dec 29, 2025 by freemedom Loading…
[Dataset] Add dataset arena-hard-v2
#2362 opened Dec 16, 2025 by Myhs-phz Loading…
[Fix] fix eval_postprocess in gpqa
#2357 opened Dec 11, 2025 by qianyp18 Loading…
feat(config): add Meta-Llama-3.1-8B-Instruct for MMLU benchmark
#2325 opened Nov 25, 2025 by 6taco Loading…
6 tasks
Reasonzoo
#2256 opened Sep 1, 2025 by epsilondylan Loading…
2 tasks
[Update] compassbench subjective config
#2238 opened Aug 14, 2025 by Myhs-phz Loading…
[fix] switch think mode
#2233 opened Aug 10, 2025 by Tianhao-Peng Loading…
2 of 6 tasks
[Fix] add tqdm for BlueLMAPI
#2226 opened Aug 1, 2025 by Myhs-phz Loading…
[Feature] Support eval for C-Eval test split
#2218 opened Jul 24, 2025 by HYZ17 Loading…
[Fix] Modify error handling and fix typo
#2216 opened Jul 24, 2025 by Owen-Qin Loading…
[Fix] Deprecate unused and error formated math500 gen file
#2206 opened Jul 17, 2025 by liushz Loading…
6 tasks
[Fix] livecodebench serialization and timeout errors
#2204 opened Jul 15, 2025 by f14-bertolotti Loading…
6 tasks
fix bug in openai_api
#2200 opened Jul 11, 2025 by LKJacky Loading…
ruler os.environ fix
#2191 opened Jul 9, 2025 by f14-bertolotti Loading…
feat:enhanced mbpp process answer patterns
#2187 opened Jul 8, 2025 by ShikangPang Loading…
6 tasks
[fix] Update README.md with a typo
#2177 opened Jul 1, 2025 by ktwu01 Loading…
Add GrandPhysics dataset
#2118 opened May 26, 2025 by Xiao-Youth Loading…
1 of 6 tasks
PromptCBLUE:Life Science dataset
#2073 opened May 4, 2025 by tchenglv520 Loading…
6 tasks done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.