Adds Quantization documentation #2189

JyotinderSingh · 2025-10-09T11:15:39Z

Adds a "Quantization in Keras" overview page, which includes:

Basic intro to quantization
Introduction to the Keras Quantization toolkit
Code Examples
Tips and recommendations.

gemini-code-assist · 2025-10-09T11:15:55Z

Summary of Changes

Hello @JyotinderSingh, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces comprehensive documentation for quantization in Keras. It covers the fundamental concepts of quantization, explains the various supported modes (int8, float8, int4, and GPTQ) with their mechanisms and benefits, and provides clear guidance and code examples for applying quantization at both the model and layer levels. The new guide aims to help users optimize Keras models for reduced memory footprint and improved inference performance.

Highlights

Quantization Overview: Introduces the fundamental concepts of quantization, highlighting its benefits such as reduced model size, lower VRAM/RAM footprint, and increased memory bandwidth for improved inference performance.
Keras Quantization Toolkit: Details the first-class post-training quantization (PTQ) workflows available in Keras, supporting various modes including int4, int8, float8, and weight-only GPTQ.
Quantization Modes Explained: Provides in-depth explanations for each quantization mode (int8, float8, int4, and GPTQ), covering their operational mechanisms, use cases, and expected outcomes.
Model and Layer-wise Quantization: Demonstrates how to apply quantization to entire models and individual layers with practical code examples, outlining typical workflows and scenarios for each approach.
Layer Coverage and Practical Guidance: Lists the core Keras layers that are supported by the quantization framework and offers practical advice for effectively utilizing GPTQ and measuring performance metrics.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds comprehensive documentation for quantization in Keras, covering various modes like int8, float8, int4, and GPTQ. The new guide is provided in notebook, markdown, and python script formats. The changes are mostly good, but I've found a couple of minor issues in the documentation text that could be clarified.

More importantly, this PR introduces a file guides/quantization_overview.py which seems to be an alternative or older draft of the quantization guide. It describes a different API and is not integrated with the rest of the documentation. To avoid confusion, this file should be removed from the pull request. My specific comments are attached to the relevant files.

guides/quantization_overview.py

guides/ipynb/quantization/overview.ipynb

JyotinderSingh · 2025-10-09T11:35:05Z

/gemini review

gemini-code-assist

Code Review

This pull request adds comprehensive documentation for quantization in Keras. The new guide is provided in Jupyter notebook, Markdown, and Python script formats, and is added to the main guides index. The documentation covers the introduction to quantization, different modes supported by Keras (int8, float8, int4, GPTQ), API usage with code examples, and practical tips. The content is well-structured and informative. I've provided a couple of suggestions to improve the consistency of markdown formatting and comments across the different file formats.

guides/md/quantization/overview.md

guides/quantization/overview.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

hertschuh

Thanks for the PR! This is really useful.

guides/quantization_overview.py

guides/quantization/overview.py

scripts/guides_master.py

JyotinderSingh requested review from fchollet and MarkDaoust as code owners October 9, 2025 11:15

github-actions bot assigned sachinprasadhs Oct 9, 2025

gemini-code-assist bot reviewed Oct 9, 2025

View reviewed changes

guides/quantization_overview.py Show resolved Hide resolved

guides/ipynb/quantization/overview.ipynb Outdated Show resolved Hide resolved

guides/ipynb/quantization/overview.ipynb Outdated Show resolved Hide resolved

Adds quantization documentation

4970e3a

JyotinderSingh force-pushed the quantization-docs branch from 14df39f to 4970e3a Compare October 9, 2025 11:34

gemini-code-assist bot reviewed Oct 9, 2025

View reviewed changes

guides/md/quantization/overview.md Outdated Show resolved Hide resolved

guides/md/quantization/overview.md Outdated Show resolved Hide resolved

guides/quantization/overview.py Outdated Show resolved Hide resolved

JyotinderSingh and others added 4 commits October 9, 2025 17:10

Update guides/md/quantization/overview.md

23fba61

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update guides/md/quantization/overview.md

a5c6141

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

improves consistency

90eb08a

improves consistency

2d61c85

hertschuh reviewed Oct 9, 2025

View reviewed changes

guides/quantization_overview.py Show resolved Hide resolved

guides/quantization/overview.py Outdated Show resolved Hide resolved

scripts/guides_master.py Outdated Show resolved Hide resolved

JyotinderSingh added 2 commits October 10, 2025 14:00

address reviews

791043f

improve formatting and add missing note

6b9cc18

JyotinderSingh force-pushed the quantization-docs branch from e5572b9 to 6b9cc18 Compare October 10, 2025 09:11

remove warning block

42ae2a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds Quantization documentation #2189

Adds Quantization documentation #2189

JyotinderSingh commented Oct 9, 2025

Uh oh!

gemini-code-assist bot commented Oct 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JyotinderSingh commented Oct 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hertschuh left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Adds Quantization documentation #2189

Are you sure you want to change the base?

Adds Quantization documentation #2189

Conversation

JyotinderSingh commented Oct 9, 2025

Uh oh!

gemini-code-assist bot commented Oct 9, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JyotinderSingh commented Oct 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hertschuh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!