Skip to content

Pinned Loading

  1. VITA VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    Python 2.4k 176

  2. Long-VITA Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    Python 291 28

  3. VITA-Audio VITA-Audio Public

    ✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Python 634 52

  4. Freeze-Omni Freeze-Omni Public

    ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

    Python 340 20

  5. Woodpecker Woodpecker Public

    ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Python 638 30

Repositories

Showing 7 of 7 repositories
  • Freeze-Omni Public

    ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

    VITA-MLLM/Freeze-Omni’s past year of commit activity
    Python 340 20 12 2 Updated May 27, 2025
  • VITA-Audio Public

    ✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    VITA-MLLM/VITA-Audio’s past year of commit activity
    Python 634 52 25 0 Updated May 24, 2025
  • Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    VITA-MLLM/Long-VITA’s past year of commit activity
    Python 291 28 6 0 Updated May 14, 2025
  • LUCY Public

    LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

    VITA-MLLM/LUCY’s past year of commit activity
    Python 55 3 11 0 Updated Apr 14, 2025
  • Sparrow Public

    Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

    VITA-MLLM/Sparrow’s past year of commit activity
    Jupyter Notebook 30 Apache-2.0 0 0 0 Updated Mar 28, 2025
  • VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    VITA-MLLM/VITA’s past year of commit activity
    Python 2,404 176 56 1 Updated Mar 28, 2025
  • Woodpecker Public

    ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

    VITA-MLLM/Woodpecker’s past year of commit activity
    Python 638 30 2 0 Updated Dec 23, 2024

Most used topics

Loading…