Skip to content

Popular repositories Loading

  1. exllamav2 exllamav2 Public

    A fast inference library for running LLMs locally on modern consumer-class GPUs

    Python 4.3k 323

  2. exllamav3 exllamav3 Public

    An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs

    Python 527 47

  3. exui exui Public

    Web UI for ExLlamaV2

    JavaScript 510 47

Repositories

Showing 3 of 3 repositories

Top languages

Loading…

Most used topics

Loading…