ModelTC repositories

LightMem

Public

Python

•

Apache License 2.0

•0•3•0•0•Updated

Feb 9, 2026

LightLLM

Public

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed perf…

nlp deep-learning llamagpt model-serving llm openai-triton

Python

•

Apache License 2.0

•299•3.9k•81•37•Updated

Feb 9, 2026

LightX2V

Public

Light Image Video Generation Inference Framework

video-generation diffusion-models wan-videoauto-regressive-diffusion-model

Python

•

Apache License 2.0

•159•1.9k•127•1•Updated

Feb 9, 2026

mtc-token-healing

Public

Token healing implementation in Rust

Rust

•

Apache License 2.0

•0•4•0•1•Updated

Feb 9, 2026

LightKernel

Public

HTML

•

Apache License 2.0

•0•3•0•0•Updated

Feb 4, 2026

Prototype

Public

Python

•

Apache License 2.0

•3•14•0•0•Updated

Feb 3, 2026

ComfyUI-Lightx2vWrapper

Public

ComfyUI custom node for lightx2v

comfyui comfyui-nodes

Python

•

MIT License

•7•78•4•0•Updated

Feb 3, 2026

general-sam-py

Public

Python bindings for general-sam and some utilities

Python

•

Apache License 2.0

•0•5•0•0•Updated

Feb 3, 2026

QVGen

Public

[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".

wan iclr qatvideo-generation diffusion-models videogen model-quantization quantization-aware-training generative-ai text-to-video-generation

Python

•

Apache License 2.0

•0•12•0•0•Updated

Feb 2, 2026

lightx2v_examples

Public

0•0•0•0•Updated

Jan 23, 2026

modeltc.github.io

Public

HTML

•0•0•0•0•Updated

Jan 14, 2026

SpargeAttn

Public

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda

•

Apache License 2.0

•84•0•0•0•Updated

Jan 12, 2026

LightTTS

Public

LightTTS is a lightweight TTS inference framework optimized for CosyVoice2 and CosyVoice3, enabling fast and scalable speech synthesis in Python and supports st…

text-to-speech real-time ttsspeech-synthesis low-latency tensorrt inference-optimization audio-generation cosyvoice cosyvoice2

Python

•

Apache License 2.0

•7•26•1•0•Updated

Jan 7, 2026

Qwen-Image-Lightning

Public

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python

•

Apache License 2.0

•45•1.2k•28•0•Updated

Jan 1, 2026

SageAttention

Public

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and v…

Cuda

•

Apache License 2.0

•338•2•0•0•Updated

Dec 18, 2025

verl

Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python

•

Apache License 2.0

•3.2k•1•0•0•Updated

Dec 15, 2025

slime

Public

slime is an LLM post-training framework for RL Scaling.

Python

•

Apache License 2.0

•505•0•0•0•Updated

Dec 8, 2025

general-sam

Public

A general suffix automaton implementation in Rust with Python bindings

Rust

•

Apache License 2.0

•0•8•0•0•Updated

Dec 1, 2025

lightllm-blog

Public

SCSS

•

MIT License

•1•1•0•1•Updated

Nov 26, 2025

greedy-tokenizer

Public

Greedily tokenize strings with the longest tokens iteratively.

Python

•

Apache License 2.0

•0•0•0•3•Updated

Nov 24, 2025

LightCompress

Public

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

benchmark deployment toolevaluation pruning quantization wan awq large-language-models llm

Python

•

Apache License 2.0

•69•673•40•0•Updated

Nov 19, 2025

Wan2.2-Lightning

Public

Wan2.2-Lightning: Speed up wan2.2 model with distillation

Python

•

Apache License 2.0

•1.7k•265•21•0•Updated

Nov 7, 2025

LTX-Video-Q8-Kernels

Public

Python

•17•0•0•0•Updated

Nov 6, 2025

SageAttention-1104

Public

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across la…

Cuda

•

Apache License 2.0

•338•0•0•0•Updated

Nov 6, 2025

FlashVSR

Public

Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained spars…

Python

•

Apache License 2.0

•108•1•0•0•Updated

Nov 5, 2025

ComfyUI-LightVAE

Public

Python

•

Apache License 2.0

•8•44•14•0•Updated

Nov 3, 2025

HBP

Public

[NeurIPS 2025] This is the official PyTorch implementation of "Hierarchical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM".

Python

•

Apache License 2.0

•0•4•0•0•Updated

Sep 30, 2025

TFMQ-DM

Public

[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

highlight quantization cvprldm diffusion-models tpami post-training-quantization ddim stable-diffusion cvpr2024

Jupyter Notebook

•

Apache License 2.0

•5•108•0•0•Updated

Sep 29, 2025

fa3

Public

Python

•

BSD 3-Clause "New" or "Revised" License

•1•0•0•0•Updated

Aug 7, 2025

flash-attn-3-build

Public

Dockerfile

•2•0•0•0•Updated

Jul 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelTC

All

All

66 repositories

LightMem

LightLLM

LightX2V

mtc-token-healing

LightKernel

Prototype

ComfyUI-Lightx2vWrapper

general-sam-py

QVGen

lightx2v_examples

modeltc.github.io

SpargeAttn

LightTTS

Qwen-Image-Lightning

SageAttention

verl

slime

general-sam

lightllm-blog

greedy-tokenizer

LightCompress

Wan2.2-Lightning

LTX-Video-Q8-Kernels

SageAttention-1104

FlashVSR

ComfyUI-LightVAE

HBP

TFMQ-DM

fa3

flash-attn-3-build

All

All

Repositories list

66 repositories