Skip to content

Conversation

@PenghaoYin
Copy link

Thanks for your excellent repository!

We’d like to add our new paper, GenExam, to your repo.

GenExam is the first benchmark that evaluates generative models through multidisciplinary exam-style tests. Results show that even state-of-the-art models achieve below 15% accuracy, while most open-source models are near 0%, highlighting the difficulty and offering insights for advancing general AGI.

@CLAassistant
Copy link

CLAassistant commented Sep 22, 2025

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants