Skip to content

Commit 0d65b58

Browse files
committed
Add abstract
1 parent b99dbab commit 0d65b58

File tree

2 files changed

+2
-6
lines changed

2 files changed

+2
-6
lines changed

.DS_Store

0 Bytes
Binary file not shown.

minicpm-o-4/index.html

Lines changed: 2 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -356,14 +356,10 @@ <h1>MiniCPM-o 4.0</h1>
356356
</div>
357357

358358
<!-- Abstract Section -->
359-
<!-- <div class="abstract-section">
359+
<div class="abstract-section">
360360
<h2>Abstract</h2>
361361
MiniCPM-o is the latest series of end-side multimodal LLMs (MLLMs) ungraded from MiniCPM-V. The models can now take images, video, text, and audio as inputs and provide high-quality text and speech outputs in an end-to-end fashion. MiniCPM-o 4.0 is the latest and most capable model in the MiniCPM-o series. With a total of 4B parameters, this end-to-end model achieves comparable performance to GPT-4o-202405 in vision, speech, and multimodal live streaming, making it one of the most versatile and performant models in the open-source community. For the new voice mode, MiniCPM-o 4.0 supports bilingual real-time speech conversation with customizable voices, and also allows for end-to-end voice cloning, role play, etc. Compared to MiniCPM-o-2.6, we enhancd the stability and naturalness of speech conversation by introducing architecture improvements and improved data pipelines. It also advances MiniCPM-V-2.6's visual capabilities such strong OCR capability, trustworthy behavior, multilingual support, and video understanding.
362-
<div class="content-placeholder">
363-
Insert your technical report abstract here. This section should contain a comprehensive overview
364-
of your speech synthesis system, its key innovations, and main contributions.
365-
</div>
366-
</div> -->
362+
</div>
367363

368364
<!-- System Overview Section -->
369365
<div class="overview-section">

0 commit comments

Comments
 (0)