🧪 HF Distiller — Knowledge Distillation for Hugging Face Models

HF Distiller is an open-source toolkit for performing knowledge distillation on Hugging Face Transformers models. It allows developers to train smaller, faster student models from large pre-trained teacher models while maintaining high performance.

📖 Overview

Knowledge Distillation (KD) compresses a large model into a smaller one by transferring the “knowledge” learned by the teacher to the student. HF Distiller wraps around Hugging Face’s Trainer to make KD accessible, modular, and intuitive.

Key Features:

✅ Load any teacher model from Hugging Face Hub
✅ Create smaller student models from scratch
✅ Supports Hugging Face tokenizers
✅ Seamless integration with the datasets library
✅ Transparent logging and checkpointing
✅ Fully compatible with PyTorch and Transformers

🖼 Architecture

           ┌────────────────────────┐
           │      Teacher Model      │  Pretrained Hugging Face LM
           └────────────┬───────────┘
                        │
                        ▼
           ┌────────────────────────┐
           │ Knowledge Distillation  │  Transfer teacher knowledge + KD loss
           └────────────┬───────────┘
                        │
                        ▼
           ┌────────────────────────┐
           │      Student Model      │  Smaller, efficient model trained from scratch
           └────────────────────────┘

⚡ Installation

#Install transformers_distilattion (Recommended)
pip install --no-deps git+https://github.com/Dhiraj309/transformers_distillation.git

#OR

# Clone repository
git clone https://github.com/Dhiraj309/transformers_distillation.git
cd transformers_distillation.git

# Install dependencies
pip install -r requirements.txt

🏃 Quick Start

from transformers_distillation.models import load_teacher, load_student
from transformers_distillation.trainer import DistillTrainer
from transformers import AutoTokenizer, TrainingArguments
from datasets import Dataset

# Example dataset
dataset = Dataset.from_dict({"text": ["Hello world!", "AI is amazing."]})

# Load teacher
teacher = load_teacher("google-bert/bert-base-uncased")
tokenizer = AutoTokenizer.from_pretrained("google-bert/bert-base-uncased")

# Create student model
student = load_student(
    model_name_or_path="google-bert/bert-base-uncased",
    from_scratch=True,
    n_layers=4,
    n_heads=4,
    n_embd=256,
    is_pretrained=False
)

# Tokenize
def tokenize(batch):
    return tokenizer(batch["text"], max_length=128, padding=True, truncation=True)

tokenized = dataset.map(tokenize, remove_columns=["text"])

# Training arguments
training_args = TrainingArguments(
    output_dir="./student-llm",
    per_device_train_batch_size=1,
    num_train_epochs=1,
    learning_rate=2e-4,
    report_to="none"
)

# Train student with KD
trainer = DistillTrainer(
    teacher_model=teacher,
    student_model=student,
    train_dataset=tokenized,
    tokenizer=tokenizer,
    training_args=training_args,
    kd_alpha=0.5,
    temperature=2.0
)
trainer.train()

📂 Project Status

Stage	Status
Core Development	✅ Complete
Documentation	✅ Complete
Community Feedback	🚧 In Progress
Tutorials & Examples	🚧 In Progress

🤝 Collaboration

We welcome contributions from the community, including:

Pull requests for new KD strategies
Bug reports and feature requests
Tutorials and example scripts
Optimization for faster student training

🔗 GitHub: Dhiraj309 🔗 Hugging Face: dignity045

📜 License

Released under the MIT License — free to use, modify, and distribute. See LICENSE for full terms.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
examples		examples
src/transformers_distillation		src/transformers_distillation
tests		tests
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧪 HF Distiller — Knowledge Distillation for Hugging Face Models

📖 Overview

🖼 Architecture

⚡ Installation

🏃 Quick Start

📂 Project Status

🤝 Collaboration

📜 License

About

Uh oh!

Releases

Packages

Languages

License

Dhiraj309/transformers_distillation

Folders and files

Latest commit

History

Repository files navigation

🧪 HF Distiller — Knowledge Distillation for Hugging Face Models

📖 Overview

🖼 Architecture

⚡ Installation

🏃 Quick Start

📂 Project Status

🤝 Collaboration

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages