Skip to content

mmlu-pro review is much worse #3

@ABCDabcde1098234

Description

@ABCDabcde1098234

Your work is great, but I have a question. I used your code to evaluate mmlu-pro, using the same model, and https://github.com/TIGER-AI-Lab/MMLU-Pro There is a significant difference in the evaluation scores, may I ask what is going on? Looking forward to your reply.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions