Skip to content
Change the repository type filter

All

    Repositories list

    • GPTQModel

      Public
      LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
      Python
      Other
      1741.1k456Updated Apr 8, 2026Apr 8, 2026
    • Defuser

      Public
      Model defuser helper for HF Transformers
      Python
      Apache License 2.0
      0100Updated Apr 1, 2026Apr 1, 2026
    • Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-…
      Python
      Apache License 2.0
      11402Updated Mar 30, 2026Mar 30, 2026
    • LogBar

      Public
      A unified Logger and ProgressBar util with zero dependencies.
      Python
      Apache License 2.0
      0800Updated Mar 30, 2026Mar 30, 2026
    • Tokenicer

      Public
      A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.
      Python
      Apache License 2.0
      41100Updated Mar 30, 2026Mar 30, 2026
    • PyPcre

      Public
      Python
      Apache License 2.0
      2200Updated Mar 30, 2026Mar 30, 2026
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      15k100Updated Mar 26, 2026Mar 26, 2026
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      5.2k000Updated Mar 26, 2026Mar 26, 2026
    • MemLord

      Public
      Python
      Apache License 2.0
      0101Updated Nov 21, 2025Nov 21, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      3.2k000Updated Apr 17, 2025Apr 17, 2025
    • rockthem

      Public
      Cuda
      Apache License 2.0
      0000Updated Mar 13, 2025Mar 13, 2025
    • Python
      Creative Commons Attribution 4.0 International
      3000Updated Mar 6, 2025Mar 6, 2025
    • peft

      Public
      🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
      Python
      Apache License 2.0
      2.2k000Updated Mar 4, 2025Mar 4, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      33k000Updated Feb 12, 2025Feb 12, 2025
    • optimum

      Public
      🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
      Python
      Apache License 2.0
      633100Updated Feb 7, 2025Feb 7, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.