wangxiongts

Follow

Xiong Wang wangxiongts

Follow

Speech/LLM Algorithm Engineer@Alibaba Qwen Team

378 followers · 0 following

Alibaba Qwen Team
Beijing
03:49 (UTC +08:00)

Achievements

Achievements

Pinned Loading

QwenLM/Qwen3-TTS QwenLM/Qwen3-TTS Public

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 9.7k 1.2k
QwenLM/Qwen3-Omni QwenLM/Qwen3-Omni Public

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3.5k 236
QwenLM/Qwen2.5-Omni QwenLM/Qwen2.5-Omni Public

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 4k 323
VITA-MLLM/Freeze-Omni VITA-MLLM/Freeze-Omni Public

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 372 25
VITA-MLLM/VITA VITA-MLLM/VITA Public

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2.5k 183
vllm-project/vllm-omni vllm-project/vllm-omni Public

A framework for efficient model inference with omni-modality models

Python 3.2k 561