All

15 repositories

GPTQModel
Public
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
transformers quantization optimum
transformers quantization optimum peft vllm gptq sglang
Python
•
Other
•174•1.1k•45•6•Updated Apr 8, 2026Apr 8, 2026
Defuser
Public
Model defuser helper for HF Transformers
Python
•
Apache License 2.0
•0•1•0•0•Updated Apr 1, 2026Apr 1, 2026
Device-SMI
Public
Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-…
device cpu gpu
device cpu gpu smi npu xpu
Python
•
Apache License 2.0
•1•14•0•2•Updated Mar 30, 2026Mar 30, 2026
LogBar
Public
A unified Logger and ProgressBar util with zero dependencies.
Python
•
Apache License 2.0
•0•8•0•0•Updated Mar 30, 2026Mar 30, 2026
Tokenicer
Public
A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.
training tokenizer inference
training tokenizer inference token
Python
•
Apache License 2.0
•4•11•0•0•Updated Mar 30, 2026Mar 30, 2026
PyPcre
Public
Python
•
Apache License 2.0
•2•2•0•0•Updated Mar 30, 2026Mar 30, 2026
vllm
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•15k•1•0•0•Updated Mar 26, 2026Mar 26, 2026
sglang
Public
SGLang is a fast serving framework for large language models and vision language models.
Python
•
Apache License 2.0
•5.2k•0•0•0•Updated Mar 26, 2026Mar 26, 2026
MemLord
Public
Python
•
Apache License 2.0
•0•1•0•1•Updated Nov 21, 2025Nov 21, 2025
lm-evaluation-harness
Public
A framework for few-shot evaluation of language models.
Python
•
MIT License
•3.2k•0•0•0•Updated Apr 17, 2025Apr 17, 2025
rockthem
Public
Cuda
•
Apache License 2.0
•0•0•0•0•Updated Mar 13, 2025Mar 13, 2025
platinum-benchmarks
Public
Python
•
Creative Commons Attribution 4.0 International
•3•0•0•0•Updated Mar 6, 2025Mar 6, 2025
peft
Public
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python
•
Apache License 2.0
•2.2k•0•0•0•Updated Mar 4, 2025Mar 4, 2025
transformers
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•33k•0•0•0•Updated Feb 12, 2025Feb 12, 2025
optimum
Public
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
Python
•
Apache License 2.0
•633•1•0•0•Updated Feb 7, 2025Feb 7, 2025

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelCloud.ai

All

All

15 repositories

GPTQModel

Defuser

Device-SMI

LogBar

Tokenicer

PyPcre

vllm

sglang

MemLord

lm-evaluation-harness

rockthem

platinum-benchmarks

peft

transformers

optimum

All

All

Repositories list

15 repositories