Feat (brevitas_examples/llm): support for fully custom quantizers by Giuseppe5 · Pull Request #1454 · Xilinx/brevitas

Giuseppe5 · 2026-02-11T15:46:30Z

Reason for this PR

Currently, every time we need to support a new quantization type, we need to modify our entry-point in several ways (e.g., new args, new option in the dict), and this process does not scale up with more advanced quantization schemes.

Changes Made in this PR

This PR allows the user to specify a file with custom quantizers to use for our LLM entrypoint.
The user can optionally specify up to seven quantizers:

weight_quantizer
input_linear_quantizer: quantizer used specifically in linear layers
input_quant: for all other layers that are not linear (e.g., if there are conv in the network)
q_scaled_quant, k_transposed_quant, v_quant: quantizers for QKV in scaled dot product
attn_output_weights_quant: quantization for the output of sigmoid of scaled dot product

These quantizers should be put in a dict, using they specified above.
If any of the keys is not specified, then the quantizer is set to None (equivalent to no quantization)

Testing Summary

TBD

Giuseppe5 added 4 commits February 11, 2026 15:31

Feat (brevitas_examples/llm): support for fully custom quantizers

4c67ef9

fix

f42636c

added test

5be1e97

Missing file

8e8b768

Giuseppe5 changed the title ~~Custom quantizer~~ Feat (brevitas_examples/llm): support for fully custom quantizers Feb 11, 2026

Giuseppe5 added 2 commits February 11, 2026 15:53

fix diffusion

3f1b350

custom file path

bd203bc

Giuseppe5 requested a review from nickfraser February 11, 2026 18:48

Giuseppe5 added the next release PRs which should be merged for the next release label Feb 11, 2026

Giuseppe5 self-assigned this Feb 11, 2026

Giuseppe5 closed this Feb 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (brevitas_examples/llm): support for fully custom quantizers#1454

Feat (brevitas_examples/llm): support for fully custom quantizers#1454
Giuseppe5 wants to merge 6 commits intoXilinx:devfrom
Giuseppe5:custom_quantizer

Giuseppe5 commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Giuseppe5 commented Feb 11, 2026

Reason for this PR

Changes Made in this PR

Testing Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant