Change the repository type filter
All
Repositories list
39 repositories
- MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressive…
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.
MOVA
PublicMOVA: Towards Scalable and Synchronized Video–Audio GenerationMOSS-TTSD
PublicMOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, a…sglang
PublicMOSS-Audio-Tokenizer
PublicMOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming an…Website
PublicDiRL
PublicTransformerLens
Public- [ArXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
MOSS-Speech
PublicMOSS-Speech is a true speech-to-speech large language model without text guidance.FutureOmni
PublicABC-Bench
PublicEmbodied-Planner-R1
Publicrope_pp
Public- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
.github
Public- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
Lorsa
PublicSparse-dLLM
PublicReAttention
Public[ICLR2025] ReAttention, a training-free approach to break the maximum context length in length extrapolationVehicleWorld
PublicUnifiedToolHub
PublicUnifiedToolHub is a comprehensive project supporting LLM-based tool use, designed to unify various tool-use dataset formats and provide training preparation, an…- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation