zminglei

Minglei Zhu zminglei

Pinned Loading

sgl-project/sglang sgl-project/sglang Public

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23.6k 4.5k
Dao-AILab/flash-attention Dao-AILab/flash-attention Public

Fast and memory-efficient exact attention

Python 22.3k 2.4k
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
Liger-Kernel Liger-Kernel Public

Forked from linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Python
relational-database-management-system relational-database-management-system Public

A RDBMS implemented from scratch with C++

C++