Skip to content
Change the repository type filter

All

    Repositories list

    • sglang

      Public
      SGLang is a high-performance serving framework for large language models and multimodal models.
      Python
      Apache License 2.0
      5k25k5961.9kUpdated Mar 23, 2026Mar 23, 2026
    • sglang-jax

      Public
      JAX backend for SGL
      Python
      Apache License 2.0
      762529033Updated Mar 23, 2026Mar 23, 2026
    • sgl-project.github.io

      Public
      This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang
      HTML
      31115111Updated Mar 23, 2026Mar 23, 2026
    • ome

      Public
      Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton
      Go
      Apache License 2.0
      664013246Updated Mar 23, 2026Mar 23, 2026
    • sgl-docs

      Public
      MDX
      Apache License 2.0
      16400Updated Mar 23, 2026Mar 23, 2026
    • rbg

      Public
      A workload for deploying LLM inference services on Kubernetes
      Go
      Apache License 2.0
      491922525Updated Mar 23, 2026Mar 23, 2026
    • sgl-kernel-xpu

      Public
      SGLang kernel library for Intel XPU
      Python
      MIT License
      2120016Updated Mar 23, 2026Mar 23, 2026
    • srt-slurm

      Public
      Benchmark SGLang on SLURM
      Python
      37000Updated Mar 23, 2026Mar 23, 2026
    • whl

      Public
      SGLang Kernel Wheel Index
      HTML
      MIT License
      91801Updated Mar 23, 2026Mar 23, 2026
    • sgl-cookbook

      Public
      Cookbook of SGLang - Recipe
      JavaScript
      Apache License 2.0
      489967Updated Mar 22, 2026Mar 22, 2026
    • SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
      Python
      MIT License
      26964325Updated Mar 22, 2026Mar 22, 2026
    • SpecForge

      Public
      Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
      Python
      MIT License
      1867416539Updated Mar 21, 2026Mar 21, 2026
    • rbg-api

      Public
      1001Updated Mar 19, 2026Mar 19, 2026
    • A Rust reimplementation of genai-bench for benchmarking LLM serving systems at high concurrency with accurate timing and industry-standard metrics
      Python
      MIT License
      5028499Updated Mar 18, 2026Mar 18, 2026
    • sgl-kernel-npu

      Public
      SGLang kernel library for NPU
      C++
      MIT License
      961081543Updated Mar 18, 2026Mar 18, 2026
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      MIT License
      8402201Updated Mar 18, 2026Mar 18, 2026
    • sgl-flash-attn

      Public
      Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      2.5k2100Updated Mar 13, 2026Mar 13, 2026
    • mini-sglang

      Public
      A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
      Python
      MIT License
      5173.8k824Updated Mar 13, 2026Mar 13, 2026
    • sgl-test-files

      Public
      The test files for SGLang.
      MIT License
      3101Updated Feb 23, 2026Feb 23, 2026
    • FlashMLA

      Public
      FlashMLA: Efficient Multi-head Latent Attention Kernels
      C++
      MIT License
      1k000Updated Feb 16, 2026Feb 16, 2026
    • 0000Updated Jan 20, 2026Jan 20, 2026
    • ome-crd

      Public
      0000Updated Jan 15, 2026Jan 15, 2026
    • sgl-learning-materials

      Public
      Materials for learning SGLang
      MIT License
      6078500Updated Jan 5, 2026Jan 5, 2026
    • fast-hadamard-transform

      Public
      Fast Hadamard transform in CUDA, with a PyTorch interface
      C
      BSD 3-Clause "New" or "Revised" License
      56100Updated Oct 15, 2025Oct 15, 2025
    • sgl-whl

      Public
      SGLang wheels for multiple platforms
      MIT License
      21110Updated Oct 13, 2025Oct 13, 2025