Commit 0c315f1

authored and

committed

[QDQ Quantization] Refactor shared functionality into a base quantizer (#19817)

### Description This PR does not add or remove any functionality. It refactors common functionality shared by the `ONNXQuantizer` and `QDQQuantizer` classes into a new `BaseQuantizer` class. This change helps decouple the QDQ quantizer from other quantization modes and makes it easier to determine if a change to one quantization mode will impact another. ### Motivation and Context An upcoming PR aims to add mixed-precision support to QDQ models (e.g., one part of the graph uses u8 activations and another uses u16 activations). This change makes the upcoming PR smaller and should presumably make determining the impact on existing features more straightforward.

1 parent 3a9666c commit 0c315f1Copy full SHA for 0c315f1

5 files changed

+747

-700

lines changed

onnxruntime
- python/tools/quantization
- test/python/quantization
  - test_qdq.py

5 files changed

+747

-700

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit 0c315f1

5 files changed

5 files changed

File tree

5 files changed

5 files changed

0 commit comments