OpenBMB
diff --git a/‎.DS_Store‎
6 KB b/‎.DS_Store‎
6 KB
diff --git a/‎minicpm-o-4/index.html‎
Lines changed: 11 additions & 10 deletions b/‎minicpm-o-4/index.html‎
Lines changed: 11 additions & 10 deletions
@@ -3,7 +3,7 @@
 <head>
     <meta charset="UTF-8">
     <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Speech Synthesis System Demo</title>
+    <title>MiniCPM-o 4.0</title>
 
     <!-- External CSS -->
     <link href="https://fonts.googleapis.com/css?family=Roboto:300,400,500,700" rel="stylesheet">
@@ -344,25 +344,26 @@
         <!-- Main Header Section -->
         <div class="main-header">
             <h1>MiniCPM-o 4.0</h1>
-            <p class="subtitle">A Family of High-Quality Versatile Speech Generation Models</p>
-            <p class="author-info"><em>Your Team Name</em></p>
-            <p class="author-info">Your Organization</p>
+            <!-- <p class="subtitle">End-to-end, Customizable-Speaker, Stable, and Natural Voice Chat</p> -->
+            <p class="author-info"><em>MiniCPM-o Team</em></p>
+            <!-- <p class="author-info">OpenBMB</p> -->
             <div class="links">
-                <a href="#" target="_blank">[Paper]</a>
-                <a href="#" target="_blank">[Code]</a>
-                <a href="#" target="_blank">[Dataset]</a>
-                <a href="#" target="_blank">[Demo]</a>
+                <a href="https://github.com/OpenBMB/MiniCPM-o" target="_blank">[Github]</a>
+                <a href="https://huggingface.co/openbmb/MiniCPM-o-4" target="_blank">[HuggingFace]</a>
+                <!-- <a href="#" target="_blank">[Dataset]</a> -->
+                <!-- <a href="#" target="_blank">[Demo]</a> -->
             </div>
         </div>
 
         <!-- Abstract Section -->
-        <div class="abstract-section">
+        <!-- <div class="abstract-section">
             <h2>Abstract</h2>
+            MiniCPM-o is the latest series of end-side multimodal LLMs (MLLMs) ungraded from MiniCPM-V. The models can now take images, video, text, and audio as inputs and provide high-quality text and speech outputs in an end-to-end fashion. MiniCPM-o 4.0 is the latest and most capable model in the MiniCPM-o series. With a total of 4B parameters, this end-to-end model achieves comparable performance to GPT-4o-202405 in vision, speech, and multimodal live streaming, making it one of the most versatile and performant models in the open-source community. For the new voice mode, MiniCPM-o 4.0 supports bilingual real-time speech conversation with customizable voices, and also allows for end-to-end voice cloning, role play, etc. Compared to MiniCPM-o-2.6, we enhancd the stability and naturalness of speech conversation by introducing architecture improvements and improved data pipelines. It also advances MiniCPM-V-2.6's visual capabilities such strong OCR capability, trustworthy behavior, multilingual support, and video understanding. 
             <div class="content-placeholder">
                 Insert your technical report abstract here. This section should contain a comprehensive overview 
                 of your speech synthesis system, its key innovations, and main contributions.
             </div>
-        </div>
+        </div> -->
 
         <!-- System Overview Section -->
         <div class="overview-section">