add int8 alexnet, caffenet, googlenet, zfnet and squeezenet & update README (#484)

mengniwang95 · web-flow · commit 6ab957a2fe61 · 2021-11-23T13:26:36.000-08:00
* add int8 alexnet, caffenet, googlenet, zfnet and squeezenet &amp; update README

Signed-off-by: mengniwa &lt;mengni.wang@intel.com&gt;

* fix model readme

Signed-off-by: mengniwa &lt;mengni.wang@intel.com&gt;
diff --git a/README.md b/README.md
@@ -10,6 +10,7 @@ We have standardized on [Git LFS (Large File Storage)](https://git-lfs.github.co
 
 ## Models
 #### Read the [Usage](#usage-) section below for more details on the file formats in the ONNX Model Zoo (.onnx, .pb, .npz), downloading multiple ONNX models through [Git LFS command line](#gitlfs-), and starter Python code for validating your ONNX model using test data.
+#### INT8 models are generated by [Intel® Neural Compressor](https://github.com/intel/neural-compressor), read the [Introduction](https://github.com/intel/neural-compressor/blob/master/README.md) to know how to use it to quantize ONNX model.
 
 #### Vision
 * [Image Classification](#image_classification)
diff --git a/vision/classification/alexnet/README.md b/vision/classification/alexnet/README.md
@@ -2,13 +2,22 @@
 
 # AlexNet
 
-|Model        |Download  |Download (with sample test data)| ONNX version |Opset version|
-| ------------- | ------------- | ------------- | ------------- | ------------- |
-|AlexNet| [238 MB](model/bvlcalexnet-3.onnx)  |  [225 MB](model/bvlcalexnet-3.tar.gz) |  1.1 | 3|
-|AlexNet| [238 MB](model/bvlcalexnet-6.onnx)  |  [225 MB](model/bvlcalexnet-6.tar.gz) |  1.1.2 | 6|
-|AlexNet| [238 MB](model/bvlcalexnet-7.onnx)  |  [226 MB](model/bvlcalexnet-7.tar.gz) |  1.2 | 7|
-|AlexNet| [238 MB](model/bvlcalexnet-8.onnx)  |  [226 MB](model/bvlcalexnet-8.tar.gz) |  1.3 | 8|
-|AlexNet| [238 MB](model/bvlcalexnet-9.onnx)  |  [226 MB](model/bvlcalexnet-9.tar.gz) |  1.4 | 9|
+|Model        |Download  |Download (with sample test data)| ONNX version |Opset version|Top-1 accuracy (%)|Top-5 accuracy (%)|
+| ------------- | ------------- | ------------- | ------------- | ------------- | ------------- | ------------- |
+|AlexNet| [238 MB](model/bvlcalexnet-3.onnx)  |  [225 MB](model/bvlcalexnet-3.tar.gz) |  1.1 | 3| | |
+|AlexNet| [238 MB](model/bvlcalexnet-6.onnx)  |  [225 MB](model/bvlcalexnet-6.tar.gz) |  1.1.2 | 6| | |
+|AlexNet| [238 MB](model/bvlcalexnet-7.onnx)  |  [226 MB](model/bvlcalexnet-7.tar.gz) |  1.2 | 7| | |
+|AlexNet| [238 MB](model/bvlcalexnet-8.onnx)  |  [226 MB](model/bvlcalexnet-8.tar.gz) |  1.3 | 8| | |
+|AlexNet| [238 MB](model/bvlcalexnet-9.onnx)  |  [226 MB](model/bvlcalexnet-9.tar.gz) |  1.4 | 9| | |
+|AlexNet| [233 MB](model/bvlcalexnet-12.onnx)  |  [216 MB](model/bvlcalexnet-12.tar.gz) |  1.9 | 12|54.80|78.23|
+|AlexNet-int8| [58 MB](model/bvlcalexnet-12-int8.onnx)  |  [39 MB](model/bvlcalexnet-12-int8.tar.gz) |  1.9 | 12|54.68|78.23|
+> Compared with the fp32 AlextNet, int8 AlextNet's Top-1 accuracy drop ratio is 0.22%, Top-5 accuracy drop ratio is 0.05% and performance improvement is 2.26x.
+>
+> **Note** 
+>
+> Different preprocess methods will lead to different accuracies, the accuracy in table depends on this specific [preprocess method](https://github.com/intel/neural-compressor/blob/master/examples/onnxrt/onnx_model_zoo/alexnet/main.py).
+> 
+> The performance depends on the test hardware. Performance data here is collected with Intel® Xeon® Platinum 8280 Processor, 1s 4c per instance, CentOS Linux 8.3, data batch size is 1.
 
 ## Description
 AlexNet is the name of a convolutional neural network for classification,
@@ -18,9 +27,6 @@ Differences:
 - not training with the relighting data-augmentation;
 - initializing non-zero biases to 0.1 instead of 1 (found necessary for training, as initialization to 1 gave flat loss).
 
-### Paper
-[ImageNet Classification with Deep Convolutional Neural Networks](https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf)
-
 ### Dataset
 [ILSVRC2012](http://www.image-net.org/challenges/LSVRC/2012/)
 
@@ -56,5 +62,38 @@ This model obtains a top-1 accuracy 57.1% and a top-5 accuracy
 (Using the average of 10 crops, (4 + 1 center) * 2 mirror,
 should obtain a bit higher accuracy.)
 
+## Quantization
+AlexNet-int8 is obtained by quantizing fp32 AlexNet model. We use [Intel® Neural Compressor](https://github.com/intel/neural-compressor) with onnxruntime backend to perform quantization. View the [instructions](https://github.com/intel/neural-compressor/blob/master/examples/onnxrt/onnx_model_zoo/alexnet/README.md) to understand how to use Intel® Neural Compressor for quantization.
+
+### Environment
+onnx: 1.9.0 
+onnxruntime: 1.8.0
+
+### Prepare model
+```shell
+wget https://github.com/onnx/models/blob/master/vision/classification/alexnet/model/bvlcalexnet-12.onnx
+```
+
+### Model quantize
+Make sure to specify the appropriate dataset path in the configuration file.
+```bash
+bash run_tuning.sh --input_model=path/to/model \  # model path as *.onnx
+                   --config=alexnet.yaml \
+                   --data_path=/path/to/imagenet \
+                   --label_path=/path/to/imagenet/label \
+                   --output_model=path/to/save
+```
+
+## References
+* [ImageNet Classification with Deep Convolutional Neural Networks](https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf)
+
+* [Intel® Neural Compressor](https://github.com/intel/neural-compressor)
+
+## Contributors
+* [mengniwang95](https://github.com/mengniwang95) (Intel)
+* [airMeng](https://github.com/airMeng) (Intel)
+* [ftian1](https://github.com/ftian1) (Intel)
+* [hshen14](https://github.com/hshen14) (Intel)
+
 ## License
 [BSD-3](LICENSE)
diff --git a/vision/classification/alexnet/model/bvlcalexnet-12-int8.onnx b/vision/classification/alexnet/model/bvlcalexnet-12-int8.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:d53bbedf100be79277cf55d78c72bdcb67d88786988561bf5d530f038e443c7b
+size 60984008
diff --git a/vision/classification/alexnet/model/bvlcalexnet-12-int8.tar.gz b/vision/classification/alexnet/model/bvlcalexnet-12-int8.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6ec09e19e5475c51fda2d6b6205f194969ff07199e19944934b806414b34ef2e
+size 40683138
diff --git a/vision/classification/alexnet/model/bvlcalexnet-12.onnx b/vision/classification/alexnet/model/bvlcalexnet-12.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6b0da9510f27f234ee96a85dc3809dd210150fdefad87f6850f9788513216945
+size 243863787
diff --git a/vision/classification/alexnet/model/bvlcalexnet-12.tar.gz b/vision/classification/alexnet/model/bvlcalexnet-12.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:24abe33e5587bf8df8f824559d4b8c0d24db860efbe21152dc9536451ce8ddc1
+size 226657165
diff --git a/vision/classification/caffenet/README.md b/vision/classification/caffenet/README.md
@@ -2,13 +2,22 @@
 
 # CaffeNet
 
-|Model        |Download  |Download (with sample test data)| ONNX version |Opset version|
-| ------------- | ------------- | ------------- | ------------- | ------------- |
-|CaffeNet| [238 MB](model/caffenet-3.onnx)  |  [244 MB](model/caffenet-3.tar.gz) |  1.1 | 3|
-|CaffeNet| [238 MB](model/caffenet-6.onnx)  |  [244 MB](model/caffenet-6.tar.gz) |  1.1.2 | 6|
-|CaffeNet| [238 MB](model/caffenet-7.onnx)  |  [244 MB](model/caffenet-7.tar.gz) |  1.2 | 7|
-|CaffeNet| [238 MB](model/caffenet-8.onnx)  |  [244 MB](model/caffenet-8.tar.gz) |  1.3 | 8|
-|CaffeNet| [238 MB](model/caffenet-9.onnx)  |  [244 MB](model/caffenet-9.tar.gz) |  1.4 | 9|
+|Model        |Download  |Download (with sample test data)| ONNX version |Opset version|Top-1 accuracy (%)|Top-5 accuracy (%)|
+| ------------- | ------------- | ------------- | ------------- | ------------- |------------- | ------------- |
+|CaffeNet| [238 MB](model/caffenet-3.onnx)  |  [244 MB](model/caffenet-3.tar.gz) |  1.1 | 3| | |
+|CaffeNet| [238 MB](model/caffenet-6.onnx)  |  [244 MB](model/caffenet-6.tar.gz) |  1.1.2 | 6| | |
+|CaffeNet| [238 MB](model/caffenet-7.onnx)  |  [244 MB](model/caffenet-7.tar.gz) |  1.2 | 7| | |
+|CaffeNet| [238 MB](model/caffenet-8.onnx)  |  [244 MB](model/caffenet-8.tar.gz) |  1.3 | 8| | |
+|CaffeNet| [238 MB](model/caffenet-9.onnx)  |  [244 MB](model/caffenet-9.tar.gz) |  1.4 | 9| | |
+|CaffeNet| [233 MB](model/caffenet-12.onnx)  |  [216 MB](model/caffenet-12.tar.gz) |  1.9 | 12|56.27 |79.52 |
+|CaffeNet-int8| [58 MB](model/caffenet-12-int8.onnx)  |  [39 MB](model/caffenet-12-int8.tar.gz) |  1.9 | 12| 56.22|79.52 |
+> Compared with the fp32 CaffeNet, int8 CaffeNet's Top-1 accuracy drop ratio is 0.09%, Top-5 accuracy drop ratio is 0.13% and performance improvement is 3.08x.
+>
+> **Note** 
+>
+> Different preprocess methods will lead to different accuracies, the accuracy in table depends on this specific [preprocess method](https://github.com/intel/neural-compressor/blob/master/examples/onnxrt/onnx_model_zoo/caffenet/main.py).
+> 
+> The performance depends on the test hardware. Performance data here is collected with Intel® Xeon® Platinum 8280 Processor, 1s 4c per instance, CentOS Linux 8.3, data batch size is 1.
 
 ## Description
 CaffeNet a variant of AlexNet.
@@ -19,9 +28,6 @@ Differences:
 - not training with the relighting data-augmentation;
 - the order of pooling and normalization layers is switched (in CaffeNet, pooling is done before normalization).
 
-### Paper
-[ImageNet Classification with Deep Convolutional Neural Networks](https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf)
-
 ### Dataset
 [ILSVRC2012](http://www.image-net.org/challenges/LSVRC/2012/)
 
@@ -57,5 +63,38 @@ This model obtains a top-1 accuracy 57.4% and a top-5 accuracy
 (Using the average of 10 crops, (4 + 1 center) * 2 mirror,
 should obtain a bit higher accuracy still.)
 
+## Quantization
+CaffeNet-int8 is obtained by quantizing fp32 CaffeNet model. We use [Intel® Neural Compressor](https://github.com/intel/neural-compressor) with onnxruntime backend to perform quantization. View the [instructions](https://github.com/intel/neural-compressor/blob/master/examples/onnxrt/onnx_model_zoo/caffenet/README.md) to understand how to use Intel® Neural Compressor for quantization.
+
+### Environment
+onnx: 1.9.0 
+onnxruntime: 1.8.0
+
+### Prepare model
+```shell
+wget https://github.com/onnx/models/blob/master/vision/classification/caffenet/model/caffenet-12.onnx
+```
+
+### Model quantize
+Make sure to specify the appropriate dataset path in the configuration file.
+```bash
+bash run_tuning.sh --input_model=path/to/model \  # model path as *.onnx
+                   --config=caffenet.yaml \
+                   --data_path=/path/to/imagenet \
+                   --label_path=/path/to/imagenet/label \
+                   --output_model=path/to/save
+```
+
+## References
+* [ImageNet Classification with Deep Convolutional Neural Networks](https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf)
+
+* [Intel® Neural Compressor](https://github.com/intel/neural-compressor)
+
+## Contributors
+* [mengniwang95](https://github.com/mengniwang95) (Intel)
+* [airMeng](https://github.com/airMeng) (Intel)
+* [ftian1](https://github.com/ftian1) (Intel)
+* [hshen14](https://github.com/hshen14) (Intel)
+
 ## License
 [BSD-3](LICENSE)
diff --git a/vision/classification/caffenet/model/caffenet-12-int8.onnx b/vision/classification/caffenet/model/caffenet-12-int8.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:ee44f74d2582aa4ca7131929cd28b2a07f5f39f951d8d4f997fffcbed42aa61d
+size 60984071
diff --git a/vision/classification/caffenet/model/caffenet-12-int8.tar.gz b/vision/classification/caffenet/model/caffenet-12-int8.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:24deead9c27254631332923c021c6b9cb429ef19b3367a0cc1cb171b9375c0e5
+size 40718510
diff --git a/vision/classification/caffenet/model/caffenet-12.onnx b/vision/classification/caffenet/model/caffenet-12.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:ea563e5c47d1f58e1717cf1a2d8bf288b8adc07af0ad228b622096cd243aeecb
+size 243863798
diff --git a/vision/classification/caffenet/model/caffenet-12.tar.gz b/vision/classification/caffenet/model/caffenet-12.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:cd6077af2d35872550f2a118dc9505546eb8d80ab343f264aa90dafe5869b5c1
+size 226654456
diff --git a/vision/classification/inception_and_googlenet/googlenet/README.md b/vision/classification/inception_and_googlenet/googlenet/README.md
@@ -2,13 +2,20 @@
 
 # GoogleNet
 
-|Model        |Download  |Download (with sample test data)| ONNX version |Opset version|
-| ------------- | ------------- | ------------- | ------------- | ------------- |
-|GoogleNet| [28 MB](model/googlenet-3.onnx)  |  [31 MB](model/googlenet-3.tar.gz) |  1.1 | 3|
-|GoogleNet| [28 MB](model/googlenet-6.onnx)  |  [31 MB](model/googlenet-6.tar.gz) |  1.1.2 | 6|
-|GoogleNet| [28 MB](model/googlenet-7.onnx)  |  [31 MB](model/googlenet-7.tar.gz) |  1.2 | 7|
-|GoogleNet| [28 MB](model/googlenet-8.onnx)  |  [31 MB](model/googlenet-8.tar.gz) |  1.3 | 8|
-|GoogleNet| [28 MB](model/googlenet-9.onnx)  |  [31 MB](model/googlenet-9.tar.gz) |  1.4 | 9|
+|Model        |Download  |Download (with sample test data)| ONNX version |Opset version|Top-1 accuracy (%)|Top-5 accuracy (%)|
+| ------------- | ------------- | ------------- | ------------- | ------------- | ------------- | ------------- |
+|GoogleNet| [28 MB](model/googlenet-3.onnx)  |  [31 MB](model/googlenet-3.tar.gz) |  1.1 | 3| | |
+|GoogleNet| [28 MB](model/googlenet-6.onnx)  |  [31 MB](model/googlenet-6.tar.gz) |  1.1.2 | 6| | |
+|GoogleNet| [28 MB](model/googlenet-7.onnx)  |  [31 MB](model/googlenet-7.tar.gz) |  1.2 | 7| | |
+|GoogleNet| [28 MB](model/googlenet-8.onnx)  |  [31 MB](model/googlenet-8.tar.gz) |  1.3 | 8| | |
+|GoogleNet| [28 MB](model/googlenet-9.onnx)  |  [31 MB](model/googlenet-9.tar.gz) |  1.4 | 9| | |
+|GoogleNet| [27 MB](model/googlenet-12.onnx)  |  [25 MB](model/googlenet-12.tar.gz) |  1.9 | 12|67.78|88.34|
+|GoogleNet-int8| [7 MB](model/googlenet-12-int8.onnx)  |  [5 MB](model/googlenet-12-int8.tar.gz) |  1.9 | 12|67.73|88.32|
+> Compared with the fp32 GoogleNet, int8 GoogleNet's Top-1 accuracy drop ratio is 0.07%, Top-5 accuracy drop ratio is 0.02% and performance improvement is 1.27x.
+>
+> **Note** 
+>
+> The performance depends on the test hardware. Performance data here is collected with Intel® Xeon® Platinum 8280 Processor, 1s 4c per instance, CentOS Linux 8.3, data batch size is 1.
 
 ## Description
 GoogLeNet is the name of a convolutional neural network for classification,
@@ -19,9 +26,6 @@ Differences:
 - not training with the scale or aspect-ratio data-augmentation;
 - uses "xavier" to initialize the weights instead of "gaussian";
 
-### Paper
-[Going deeper with convolutions](https://arxiv.org/pdf/1409.4842.pdf)
-
 ### Dataset
 [ILSVRC2014](http://www.image-net.org/challenges/LSVRC/2014/)
 
@@ -98,5 +102,38 @@ a top-5 accuracy 88.9% (11.1% error) on the validation set, using
 just the center crop. (Using the average of 10 crops,
 (4 + 1 center) * 2 mirror, should obtain a bit higher accuracy.)
 
+## Quantization
+GoogleNet-int8 is obtained by quantizing fp32 GoogleNet model. We use [Intel® Neural Compressor](https://github.com/intel/neural-compressor) with onnxruntime backend to perform quantization. View the [instructions](https://github.com/intel-innersource/frameworks.ai.lpot.intel-lpot/blob/master/examples/onnxrt/onnx_model_zoo/googlenet/README.md) to understand how to use Intel® Neural Compressor for quantization.
+
+### Environment
+onnx: 1.9.0 
+onnxruntime: 1.8.0
+
+### Prepare model
+```shell
+wget https://github.com/onnx/models/blob/master/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12.onnx
+```
+
+### Model quantize
+Make sure to specify the appropriate dataset path in the configuration file.
+```bash
+bash run_tuning.sh --input_model=path/to/model \  # model path as *.onnx
+                   --config=googlenet.yaml \
+                   --data_path=/path/to/imagenet \
+                   --label_path=/path/to/imagenet/label \
+                   --output_model=path/to/save
+```
+
+## References
+* [Going deeper with convolutions](https://arxiv.org/pdf/1409.4842.pdf)
+
+* [Intel® Neural Compressor](https://github.com/intel/neural-compressor)
+
+## Contributors
+* [mengniwang95](https://github.com/mengniwang95) (Intel)
+* [airMeng](https://github.com/airMeng) (Intel)
+* [ftian1](https://github.com/ftian1) (Intel)
+* [hshen14](https://github.com/hshen14) (Intel)
+
 ## License
 [BSD-3](LICENSE)
diff --git a/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12-int8.onnx b/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12-int8.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:282290386d7055c1b1e9692680621907991a87f9839391892b5cbe4da2e9871f
+size 7122858
diff --git a/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12-int8.tar.gz b/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12-int8.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:fcedb191d6fd2453ed27f1dbf192cda8b88ec83a6d2a9448fa1d24a2c23cd675
+size 5724344
diff --git a/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12.onnx b/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:c99c507058eaf41de8723408fdda7db8325cb57f0a89f2ee07a716d6e963e14e
+size 28021836
diff --git a/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12.tar.gz b/vision/classification/inception_and_googlenet/googlenet/model/googlenet-12.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:cdc6457821113082defd34905342385f63f823363feb1bf439eee4427577659c
+size 26545491
diff --git a/vision/classification/squeezenet/README.md b/vision/classification/squeezenet/README.md
@@ -20,6 +20,13 @@ SqueezeNet 1.1 has 2.4x less computation and slightly fewer parameters than Sque
 |SqueezeNet 1.0| [5 MB](model/squeezenet1.0-7.onnx)  |  [11 MB](model/squeezenet1.0-7.tar.gz) |  1.2 | 7|
 |SqueezeNet 1.0| [5 MB](model/squeezenet1.0-8.onnx)  |  [11 MB](model/squeezenet1.0-8.tar.gz) |  1.3 | 8|
 |SqueezeNet 1.0| [5 MB](model/squeezenet1.0-9.onnx)  |  [11 MB](model/squeezenet1.0-9.tar.gz) |  1.4 | 9|
+|SqueezeNet 1.0| [5 MB](model/squeezenet1.0-12.onnx)  |  [5 MB](model/squeezenet1.0-12.tar.gz) |  1.9 | 12|56.85|79.87|
+|SqueezeNet 1.0-int8| [2 MB](model/squeezenet1.0-12-int8.onnx)  |  [2 MB](model/squeezenet1.0-12-int8.tar.gz) |  1.9 | 12|56.48|79.76|
+> Compared with the fp32 SqueezeNet 1.0, int8 SqueezeNet 1.0's Top-1 accuracy drop ratio is 0.65%, Top-5 accuracy drop ratio is 0.14% and performance improvement is 1.31x.
+>
+> **Note** 
+>
+> The performance depends on the test hardware. Performance data here is collected with Intel® Xeon® Platinum 8280 Processor, 1s 4c per instance, CentOS Linux 8.3, data batch size is 1.
 
 ## Inference
 We used MXNet as framework with gluon APIs to perform inference for SqueezeNet 1.1. View the notebook [imagenet_inference](../imagenet_inference.ipynb) to understand how to use above models for doing inference. Make sure to specify the appropriate model name in the notebook.
@@ -56,15 +63,43 @@ We used MXNet as framework with gluon APIs to perform training. View the [traini
 ## Validation
 We used MXNet as framework with gluon APIs to perform validation. Use the notebook [imagenet_validation](../imagenet_validation.ipynb) to verify the accuracy of the model on the validation set. Make sure to specify the appropriate model name in the notebook.
 
+## Quantization
+SqueezeNet 1.0-int8 is obtained by quantizing fp32 SqueezeNet 1.0 model. We use [Intel® Neural Compressor](https://github.com/intel/neural-compressor) with onnxruntime backend to perform quantization. View the [instructions](https://github.com/intel-innersource/frameworks.ai.lpot.intel-lpot/blob/master/examples/onnxrt/onnx_model_zoo/squeezenet/README.md) to understand how to use Intel® Neural Compressor for quantization.
+
+### Environment
+onnx: 1.9.0 
+onnxruntime: 1.8.0
+
+### Prepare model
+```shell
+wget https://github.com/onnx/models/blob/master/vision/classification/squeezenet/model/squeezenet1.0-12.onnx
+```
+
+### Model quantize
+Make sure to specify the appropriate dataset path in the configuration file.
+```bash
+bash run_tuning.sh --input_model=path/to/model \  # model path as *.onnx
+                   --config=squeezenet.yaml \
+                   --data_path=/path/to/imagenet \
+                   --label_path=/path/to/imagenet/label \
+                   --output_model=path/to/save
+```
+
 ## References
 * **SqueezeNet1.1**
 SqueezeNet1.1 presented in the [Official SqueezeNet repo](https://github.com/DeepScale/SqueezeNet/tree/master/SqueezeNet_v1.1) is an improved version of SqueezeNet1.0 from the paper [SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size](https://arxiv.org/abs/1602.07360)
 
 * [MXNet](http://mxnet.incubator.apache.org), [Gluon model zoo](https://mxnet.incubator.apache.org/api/python/gluon/model_zoo.html), [GluonCV](https://gluon-cv.mxnet.io)
 
+* [Intel® Neural Compressor](https://github.com/intel/neural-compressor)
+
 ## Contributors
 * [abhinavs95](https://github.com/abhinavs95) (Amazon AI)
 * [ankkhedia](https://github.com/ankkhedia) (Amazon AI)
+* [mengniwang95](https://github.com/mengniwang95) (Intel)
+* [airMeng](https://github.com/airMeng) (Intel)
+* [ftian1](https://github.com/ftian1) (Intel)
+* [hshen14](https://github.com/hshen14) (Intel)
 
 ## License
 Apache 2.0
diff --git a/vision/classification/squeezenet/model/squeezenet1.0-12-int8.onnx b/vision/classification/squeezenet/model/squeezenet1.0-12-int8.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:3da17dfad1b7ba23c93fac6dbf49f6db78cd42f7519e915a2e27d37c5c0a972b
+size 1293388
diff --git a/vision/classification/squeezenet/model/squeezenet1.0-12-int8.tar.gz b/vision/classification/squeezenet/model/squeezenet1.0-12-int8.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:28de6cb53cdaf15b81d293dc480b5f4f9fd766ab05bc97dbc1a5befb6981f1b2
+size 1562359
diff --git a/vision/classification/squeezenet/model/squeezenet1.0-12.onnx b/vision/classification/squeezenet/model/squeezenet1.0-12.onnx
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:dec81a8684617770b3cf13fadc1d92565d1d453d23935fc6388b792d99c992bd
+size 4952956
diff --git a/vision/classification/squeezenet/model/squeezenet1.0-12.tar.gz b/vision/classification/squeezenet/model/squeezenet1.0-12.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:8a2dcc5a8f2b8c314b96b6484703a72882c796faa4eaf0de1b913fc0765cb917
+size 5151210
diff --git a/vision/classification/zfnet-512/README.md b/vision/classification/zfnet-512/README.md
diff --git a/vision/classification/zfnet-512/model/zfnet512-12-int8.onnx b/vision/classification/zfnet-512/model/zfnet512-12-int8.onnx
diff --git a/vision/classification/zfnet-512/model/zfnet512-12-int8.tar.gz b/vision/classification/zfnet-512/model/zfnet512-12-int8.tar.gz
diff --git a/vision/classification/zfnet-512/model/zfnet512-12.onnx b/vision/classification/zfnet-512/model/zfnet512-12.onnx
diff --git a/vision/classification/zfnet-512/model/zfnet512-12.tar.gz b/vision/classification/zfnet-512/model/zfnet512-12.tar.gz

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:d53bbedf100be79277cf55d78c72bdcb67d88786988561bf5d530f038e443c7b`
	`3`	`+size 60984008`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:6ec09e19e5475c51fda2d6b6205f194969ff07199e19944934b806414b34ef2e`
	`3`	`+size 40683138`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:6b0da9510f27f234ee96a85dc3809dd210150fdefad87f6850f9788513216945`
	`3`	`+size 243863787`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:24abe33e5587bf8df8f824559d4b8c0d24db860efbe21152dc9536451ce8ddc1`
	`3`	`+size 226657165`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:ee44f74d2582aa4ca7131929cd28b2a07f5f39f951d8d4f997fffcbed42aa61d`
	`3`	`+size 60984071`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:24deead9c27254631332923c021c6b9cb429ef19b3367a0cc1cb171b9375c0e5`
	`3`	`+size 40718510`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+version https://git-lfs.github.com/spec/v1`
	`2`	`+oid sha256:ea563e5c47d1f58e1717cf1a2d8bf288b8adc07af0ad228b622096cd243aeecb`
	`3`	`+size 243863798`