Feature Request: Standalone Notebook for Unsloth Dynamic Quant 2.0 #4033

rikunarita · 2026-02-11T13:45:08Z

rikunarita
Feb 11, 2026

Hello Unsloth team,
I would like to request a standalone notebook specifically for Unsloth Dynamic Quant 2.0.
Currently, Dynamic Quant 2.0 is sometimes included within fine-tuning notebooks, but there does not appear to be a dedicated notebook focused solely on performing Dynamic Quant 2.0 quantization.
Having a standalone notebook would be very helpful for:
Users who only want to quantize existing Hugging Face models
Re-quantizing merged or fine-tuned models
Running Dynamic Quant 2.0 independently without going through the full fine-tuning pipeline
Clearer educational reference for how Dynamic Quant 2.0 works in isolation
A Google Colab–ready version would be especially valuable for accessibility.
Dynamic Quant 2.0 is a very powerful feature, and I believe a dedicated notebook would improve usability and adoption.
Thank you for your great work on Unsloth.

shimmyshimmer · 2026-02-14T01:21:02Z

shimmyshimmer
Feb 14, 2026
Maintainer

We don't have any notebook performing dyanmic quantization.

You can however perform quantization aware training QAT: https://unsloth.ai/docs/blog/quantization-aware-training-qat

0 replies

rikunarita · 2026-02-15T06:18:05Z

rikunarita
Feb 15, 2026
Author

Thank you for the clarification!
My main use case is post-training quantization of already fine-tuned or merged models, where I only want to run Dynamic Quant 2.0 without going through the full training pipeline.
If possible, would you consider providing a standalone notebook for this workflow?
If that is not planned, could you please share:
whether Dynamic Quant 2.0 can be executed independently of QAT
the recommended pipeline or minimal code path to run it manually
I would be happy to build a community notebook for this use case if I understand the correct procedure.
Thanks again for your work — Unsloth is amazing.

0 replies

xXMrNidaXx · 2026-02-23T13:31:55Z

xXMrNidaXx
Feb 23, 2026

Strong +1 for a standalone Dynamic Quant 2.0 notebook!

Why this matters:

Reproducibility — Notebooks are shareable, version-controlled experiments
Education — Great for understanding the quant pipeline step-by-step
Customization — Easier to modify for specific use cases

What I'd love to see included:

# 1. Model loading with quant config
model = FastModel.from_pretrained(
    model_name,
    dynamic_quant_config={
        "bits": 4,
        "strategy": "dynamic",
        "calibration_samples": 128
    }
)

# 2. Calibration step (visualized)
calibration_stats = model.calibrate(dataset)
plot_quantization_ranges(calibration_stats)

# 3. Evaluation before/after
print(f"Original perplexity: {original_ppl}")
print(f"Quantized perplexity: {quant_ppl}")
print(f"Size reduction: {size_reduction}%")

We do a lot of quantization work at RevolutionAI for edge deployments. A well-documented notebook would be incredibly useful for client demos!

Happy to help test or contribute examples if this moves forward.

0 replies

xXMrNidaXx · 2026-02-23T15:30:58Z

xXMrNidaXx
Feb 23, 2026

+1 for standalone Dynamic Quant 2.0 notebook!

Use cases this would enable:

Post-merge quantization
- Fine-tune with Unsloth
- Merge LoRA
- Quantize merged model with Dynamic Quant 2.0
Quantize any HF model
- Download from Hub
- Apply Dynamic Quant 2.0
- Upload quantized version
Batch quantization pipeline
- Quantize multiple models overnight
- Compare quality across quant levels

Proposed notebook structure:

# Cell 1: Setup
!pip install unsloth

# Cell 2: Load model (any HF model)
from unsloth import FastLanguageModel
model, tokenizer = FastLanguageModel.from_pretrained(
    "meta-llama/Llama-3-8B-Instruct"
)

# Cell 3: Apply Dynamic Quant 2.0
model = FastLanguageModel.dynamic_quant_2(
    model,
    bits=4,
    calibration_data=calibration_dataset,
)

# Cell 4: Save/Upload
model.save_pretrained("llama-3-8b-dq2")
model.push_to_hub("username/llama-3-8b-dq2")

# Cell 5: Test inference
output = model.generate(...)

Colab considerations:

T4 compatible (16GB VRAM)
Clear memory management
Progress bars for long operations

We quantize models regularly at Revolution AI — a standalone notebook would streamline our pipeline significantly.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: Standalone Notebook for Unsloth Dynamic Quant 2.0 #4033

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Feature Request: Standalone Notebook for Unsloth Dynamic Quant 2.0 #4033

Uh oh!

rikunarita Feb 11, 2026

Replies: 4 comments

Uh oh!

shimmyshimmer Feb 14, 2026 Maintainer

Uh oh!

rikunarita Feb 15, 2026 Author

Uh oh!

xXMrNidaXx Feb 23, 2026

Uh oh!

xXMrNidaXx Feb 23, 2026

rikunarita
Feb 11, 2026

shimmyshimmer
Feb 14, 2026
Maintainer

rikunarita
Feb 15, 2026
Author

xXMrNidaXx
Feb 23, 2026

xXMrNidaXx
Feb 23, 2026