Feat (export/onnx): fallback export to fake quantized weights by Giuseppe5 · Pull Request #1395 · Xilinx/brevitas

Giuseppe5 · 2025-10-16T14:34:46Z

Reason for this PR

ONNX's QuantizeLinear node only supports ROUND as rounding function, but there are cases where we might be using FLOOR.

Changes Made in this PR

This PR addresses this issue by falling back to saving and exporting fake quantized weights.
With this trick, we use whatever interal rounding operation to pre-quantize the weights, so that round is basically a no-op during export time.

The drawback of this approach is that we effectively duplicate weights during export.

This could be remedied for example by destructively replacing the original weights with the fake quantized ones, with the obvious drawback that we lose access to the original model's weights.

Testing Summary

Added tests with floor rounding format for weights.
This effectively double the testing configurations and the testing time.

nickfraser

One comment about a duplicated function, otherwise LGTM!

tests/brevitas_ort/test_quant_module.py

nickfraser · 2025-10-22T18:21:27Z

FYI, the FINN integration issues are likely fixed in #1400

nickfraser approved these changes Oct 17, 2025

View reviewed changes

tests/brevitas_ort/test_quant_module.py Outdated Show resolved Hide resolved

Giuseppe5 requested a review from nickfraser October 17, 2025 15:10

nickfraser assigned Giuseppe5 Oct 20, 2025

nickfraser added the next release PRs which should be merged for the next release label Oct 20, 2025

nickfraser mentioned this pull request Oct 21, 2025

v0.13.0 Minor Release #1290

Open

34 tasks

Giuseppe5 added 4 commits October 23, 2025 10:24

Feat (export/onnx): fallback export to fake quantized weights

3fb15f1

Test + warning

954621f

Fix tests

6fbeaf3

Reduce test combinations

7e92477

nickfraser force-pushed the export_fake_quantized branch from 9555e11 to 7e92477 Compare October 23, 2025 09:24

nickfraser requested review from pablomlago and removed request for pablomlago October 23, 2025 09:24

tests (ort): don't duplicate rm_onnx function.

f214612

nickfraser merged commit 004479e into Xilinx:dev Oct 24, 2025
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (export/onnx): fallback export to fake quantized weights#1395

Feat (export/onnx): fallback export to fake quantized weights#1395
nickfraser merged 5 commits intoXilinx:devfrom
Giuseppe5:export_fake_quantized

Giuseppe5 commented Oct 16, 2025 •

edited

Loading

Uh oh!

nickfraser left a comment

Uh oh!

Uh oh!

nickfraser commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Giuseppe5 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reason for this PR

Changes Made in this PR

Testing Summary

Uh oh!

nickfraser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nickfraser commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Giuseppe5 commented Oct 16, 2025 •

edited

Loading