Skip to content

Commit 0098c93

Browse files
authored
Update README.md
1 parent 7053e7f commit 0098c93

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
# Multimodal Chain-of-Thought Reasoning in Language Models
22

3+
<h5 align="center"><i>"Imagine learning a textbook without figures or tables."</i></h5>
4+
35
Multimodal-CoT incorporates vision features in a decoupled training framework. The framework consists of two training stages: (i) rationale generation and (ii) answer inference. Both stages share the same model architecture but differ in the input and output.
46

57
![](vision_features/mm-cot.png)

0 commit comments

Comments
 (0)