Skip to content

Commit 4b82995

Browse files
committed
Update Tracking Figure
1 parent 81b1f3a commit 4b82995

File tree

2 files changed

+5
-0
lines changed

2 files changed

+5
-0
lines changed

index.html

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,6 +49,9 @@
4949
<a class="navbar-item" href="https://huggingface.co/datasets/MMMU/MMMU_Pro">
5050
<b>MMMU-Pro</b> <span style="font-size:18px; display: inline; margin-left: 5px;">🔥</span>
5151
</a>
52+
<a class="navbar-item" href="https://videommmu.github.io/">
53+
<b>Video-MMMU</b> <span style="font-size:18px; display: inline; margin-left: 5px;">🔥</span>
54+
</a>
5255
<a class="navbar-item" href="https://tiger-ai-lab.github.io/MAmmoTH/">
5356
MAmmoTH
5457
</a>
@@ -238,6 +241,8 @@ <h2 class="title is-3">Introduction</h2>
238241
<p>
239242
We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning. MMMU includes <b>11.5K</b> meticulously collected multimodal questions from college exams, quizzes, and textbooks, covering six core disciplines: Art & Design, Business, Science, Health & Medicine, Humanities & Social Science, and Tech & Engineering. These questions span <b>30</b> subjects and <b>183</b> subfields, comprising 30 highly heterogeneous image types, such as charts, diagrams, maps, tables, music sheets, and chemical structures. Unlike existing benchmarks, MMMU focuses on advanced perception and reasoning with domain-specific knowledge, challenging models to perform tasks akin to those faced by experts. Our evaluation of 14 open-source LMMs and the proprietary GPT-4V(ision) highlights the substantial challenges posed by MMMU. Even the advanced GPT-4V only achieves a 56% accuracy, indicating significant room for improvement. We believe MMMU will stimulate the community to build next-generation multimodal foundation models towards expert artificial general intelligence.
240243
</p>
244+
<img src="static/images/MMMU_Tracking_Figure.png" alt="algebraic reasoning" width="100%">
245+
<br>
241246
</div>
242247
</div>
243248
</div>
826 KB
Loading

0 commit comments

Comments
 (0)