Change the repository type filter
All
Repositories list
8 repositories
Omni-Diffusion
Public- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
VITA-Audio
Public- ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy
LUCY
PublicSparrow
PublicVITA
Public✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech InteractionWoodpecker
Public✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models