Skip to content
Change the repository type filter

All

    Repositories list

    • GPTQModel

      Public
      LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
      Python
      1601k4415Updated Feb 15, 2026Feb 15, 2026
    • Tokenicer

      Public
      A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.
      Python
      41000Updated Feb 9, 2026Feb 9, 2026
    • LogBar

      Public
      A unified Logger and ProgressBar util with zero dependencies.
      Python
      0800Updated Dec 24, 2025Dec 24, 2025
    • PyPcre

      Public
      Python
      2201Updated Dec 16, 2025Dec 16, 2025
    • Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-…
      Python
      11401Updated Dec 12, 2025Dec 12, 2025
    • MemLord

      Public
      Python
      0101Updated Nov 21, 2025Nov 21, 2025
    • A framework for few-shot evaluation of language models.
      Python
      3.1k000Updated Apr 17, 2025Apr 17, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      14k100Updated Mar 27, 2025Mar 27, 2025
    • rockthem

      Public
      Cuda
      0000Updated Mar 13, 2025Mar 13, 2025
    • Python
      3000Updated Mar 6, 2025Mar 6, 2025
    • peft

      Public
      🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
      Python
      2.2k000Updated Mar 4, 2025Mar 4, 2025
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      4.5k000Updated Mar 4, 2025Mar 4, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      32k000Updated Feb 12, 2025Feb 12, 2025
    • optimum

      Public
      🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
      Python
      617100Updated Feb 7, 2025Feb 7, 2025