Skip to content

Commit a91f14a

Browse files
committed
Merge branch 'chi/trt_ep_quantization_yolov3_update' of https://github.com/microsoft/onnxruntime-inference-examples into chi/trt_ep_quantization_yolov3_update
2 parents 4200830 + 7d2ece0 commit a91f14a

File tree

1 file changed

+2
-0
lines changed
  • quantization/image_classification/trt/resnet50

1 file changed

+2
-0
lines changed

quantization/image_classification/trt/resnet50/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
# ONNX PTQ overview
22
Following is the end-to-end example using ORT quantization tool to quantize ONNX model, specifially image classification model, and run/evaluate the quantized model with TRT EP.
33

4+
## Note
5+
Implicit quantization is deprecated in TRT 10.1 (using calibration table in TRT EP to set `setDynamicRange`), we suggest to use explicit quantization aka QDQ format.
46
## Environment setup
57
### dataset
68
First, prepare the dataset for calibration. TensorRT recommends calibration data size to be at least 500 for CNN and ViT models.

0 commit comments

Comments
 (0)