VAP3-964: Update Inworld page

nitishsurana · nitishsurana · commit 96219c040147 · 2025-07-10T10:09:43.000-07:00
diff --git a/fern/providers/voice/inworld.mdx b/fern/providers/voice/inworld.mdx
@@ -1,63 +1,35 @@
 ---
-title: InworldAI
-subtitle: What is Inworld.ai?
+title: Inworld
+subtitle: What is Inworld?
 slug: providers/voice/inworld
 ---
 
-**What is Inworld.ai?**
+**What is Inworld?**
 
-Inworld.ai provides developers with tools to create lifelike voice agents. It supports zero-shot voice cloning, enabling the creation of personalized voices from short audio samples. The system is optimized for low-latency streaming, making it suitable for applications requiring immediate audio responses.
+Inworld develops AI products for builders of consumer applications, enabling scaled applications that grow into user needs and organically evolve through experience. This includes a text-to-speech service that makes state-of-the-art voice AI radically more accessible for developers. Inworld TTS is optimized for low-latency streaming, making it suitable for applications requiring immediate audio responses.
 
-**The Evolution of AI Speech Synthesis:**
+**Overview of State-of-the-Art Inworld TTS:**
 
-Advancements in deep learning and neural networks have significantly improved the quality of AI-generated speech. Inworld.ai leverages these developments to deliver natural-sounding, emotionally expressive voices suitable for various applications, including virtual assistants and interactive games.
+Advancements in LLM-based speech models have significantly improved the quality of AI-generated speech. Inworld leverages these developments to deliver natural-sounding, emotionally expressive voices suitable for various applications, including virtual assistants, interactive games, and more. Inworld provides a comprehensive suite of features designed to meet diverse voice synthesis needs:
 
-**Overview of Inworld.ai's Offerings:**
+- Real-Time Speech Synthesis: Inworld is engineered for real-time performance, delivering the first 2-second audio chunk in as few as 200ms. This responsiveness is critical for real-time applications such as conversational agents and interactive characters.
+- Multilingual Support: Inworld supports 11 languages, including English, Spanish, French, Korean, Chinese, and more. This multilingual capability enables developers to build applications for diverse global audiences.
+- Developer API: Inworld provides an API with comprehensive documentation, facilitating integration into various applications. The API supports real-time streaming and offers options for customizing voice parameters to suit specific use cases.
 
-Inworld.ai provides a comprehensive suite of features designed to meet diverse voice synthesis needs:
+**Use Cases:**
 
-**Real-Time Speech Synthesis:**
+Inworld TTS supports a wide range of applications:
 
-Inworld.ai  is engineered for low-latency performance, delivering the first two seconds of audio in approximately 200 milliseconds. This responsiveness is critical for real-time applications such as conversational agents and interactive gaming characters.
-
-**Zero-Shot Voice Cloning:**
-
-The platform offers zero-shot voice cloning, allowing developers to create custom voices from as little as 5 seconds of audio input. This feature facilitates the development of unique voice identities for various applications.
-
-**Multilingual Support:**
-
-Inworld.ai supports 11 languages, including English, Spanish, French, Korean, and Chinese. This multilingual capability enables developers to build applications for diverse global audiences.
-
-**Audio Markup Controls:**
-
-Developers can use audio markup tags such as [happy], [whispering], or [sigh] to control the emotional tone and style of the synthesized speech. This feature enhances the expressiveness of voice agents.
-
-**Developer API:**
-
-Inworld.ai provides an API with comprehensive documentation, facilitating integration into various applications. The API supports real-time streaming and offers options for customizing voice parameters to suit specific use cases.
-
-**Use Cases for Inworld.ai:**
-
-Inworld.ai's versatile platform supports a wide range of applications:
-
-**Interactive Applications:**
-
-Developers can create responsive voice agents for customer service, virtual assistants, and interactive gaming characters, enhancing user engagement through natural-sounding speech.
-
-**Content Creation:**
-
-Content creators can utilize Inworld.ai to generate high-quality voiceovers for videos, podcasts, and other media, streamlining the production process.
-
-**Education and Training:**
-
-Educational platforms can employ Inworld.ai to provide clear and expressive narration for e-learning materials, improving the learning experience for users.
+- Interactive Applications: Developers can create responsive voice agents for customer service, virtual assistants, and interactive characters, enhancing user engagement through natural-sounding speech.
+- Content Creation: Content creators can utilize Inworld to generate professional-grade voiceovers for videos, podcasts, and other media, streamlining the production process.
+- Education and Training: Educational platforms can employ Inworld to provide clear and expressive narration for e-learning materials, improving the learning experience for users.
 
 **Integration with Vapi:**
 
-Inworld.ai's voice model is fully integrated with Vapi, giving developers an easy way to deploy expressive, low-latency voices in their assistants.
+Inworld voices are fully integrated with Vapi, giving developers an easy way to deploy expressive, real-time latency voices in their assistants.
 
-To use Inworld.ai's model, open your assistant in the Vapi dashboard, scroll to the Voice Configuration section, choose Inworld as the provider, select a language and voice, then hit publish. And you're live.
+To use Inworld voices, open your assistant in the Vapi dashboard and scroll to the Voice Configuration section. Choose Inworld as the provider, select a language and voice. Hit publish. And you’re live!
 
 **Conclusion:**
 
-Inworld.ai offers a combination of expressive voice synthesis, low-latency performance, and multilingual support, making it a valuable tool for developers seeking to enhance their applications with natural-sounding speech.
+Inworld offers a combination of expressive voice synthesis, real-time performance, and multilingual support, making it a valuable tool for developers seeking to enhance their applications with natural-sounding speech.