Skip to content
Change the repository type filter

All

    Repositories list

    • FlashInfer: Kernel Library for LLM Serving
      Python
      Apache License 2.0
      8135.2k349175Updated Mar 20, 2026Mar 20, 2026
    • whl

      Public
      Pre-built wheels for flashinfer python package.
      HTML
      5200Updated Mar 20, 2026Mar 20, 2026
    • Building the Virtuous Cycle for AI-driven LLM Systems
      Python
      Apache License 2.0
      302031716Updated Mar 19, 2026Mar 19, 2026
    • CSS
      0000Updated Mar 17, 2026Mar 17, 2026
    • FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels
      Python
      10915273Updated Mar 13, 2026Mar 13, 2026
    • Python
      Apache License 2.0
      82500Updated Mar 12, 2026Mar 12, 2026
    • ci-infra

      Public
      Shell
      Apache License 2.0
      1000Updated Feb 20, 2026Feb 20, 2026
    • cubloaty

      Public
      a size profiler for cuda binary
      Python
      Apache License 2.0
      07110Updated Jan 15, 2026Jan 15, 2026
    • Project website of FlashInfer project
      SCSS
      4020Updated Jan 3, 2026Jan 3, 2026
    • Python
      4302Updated Oct 29, 2025Oct 29, 2025
    • web-data

      Public
      Apache License 2.0
      0000Updated Jun 25, 2025Jun 25, 2025
    • Python
      Apache License 2.0
      36500Updated Apr 26, 2025Apr 26, 2025
    • Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom trace generation (for y…
      Python
      Apache License 2.0
      8100Updated Apr 16, 2025Apr 16, 2025
    • flashinfer-nightly

      Public archive
      FlashInfer Nightly
      MIT License
      1600Updated Apr 9, 2025Apr 9, 2025
    • Apache License 2.0
      0400Updated Apr 2, 2025Apr 2, 2025
    • Jupyter Notebook
      0200Updated Jan 10, 2025Jan 10, 2025
    • Debug print operator for cudagraph debugging
      Cuda
      21411Updated Aug 2, 2024Aug 2, 2024
    • The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
      Other
      17k000Updated Apr 21, 2024Apr 21, 2024
    • candle

      Public
      Minimalist ML framework for Rust
      Rust
      Apache License 2.0
      1.5k000Updated Mar 7, 2024Mar 7, 2024