Transformers-Patch 🛠️

Memory optimization patches for HuggingFace Transformers.

Features ✨

Memory Reduction - Significantly lowers memory usage in Transformers models
Zero Configuration - Works automatically after import

Installation ⚡

pip install git+https://github.com/GeeeekExplorer/transformers-patch.git

Quick Start 🚀

Just import the patch before loading any Transformers models:

import transformers_patch
from transformers import AutoModel

Benchmark 📊

Test Configuration:

8x GPU machine
Micro batch size: 1
Sequence length: 4096
Gradient checkpointing: Disabled
Model: Qwen3-8B

Memory Component	Fixed Allocation	Before Patch	After Patch
Model + Gradients	30.5 GB	-	-
ZeRO Optimizer States	11.4 GB	-	-
Activations	-	35.4 GB	17.8 GB

50% reduction in activation memory!

Example Usage 📋

See complete example in train.py.

Acknowledgements 🙏

unsloth

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
transformers_patch		transformers_patch
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transformers-Patch 🛠️

Features ✨

Installation ⚡

Quick Start 🚀

Benchmark 📊

Example Usage 📋

Acknowledgements 🙏

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

GeeeekExplorer/transformers-patch

Folders and files

Latest commit

History

Repository files navigation

Transformers-Patch 🛠️

Features ✨

Installation ⚡

Quick Start 🚀

Benchmark 📊

Example Usage 📋

Acknowledgements 🙏

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages