Skip to content
Change the repository type filter

All

    Repositories list

    • exllamav3

      Public
      An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
      Python
      61588474Updated Dec 6, 2025Dec 6, 2025
    • exllamav2

      Public
      A fast inference library for running LLMs locally on modern consumer-class GPUs
      Python
      3244.4k13622Updated Aug 16, 2025Aug 16, 2025
    • exui

      Public
      Web UI for ExLlamaV2
      JavaScript
      47514343Updated Feb 5, 2025Feb 5, 2025