lizexu123
diff --git a/‎demo/auto-compression/detection/README.md
Lines changed: 0 additions & 57 deletions b/‎demo/auto-compression/detection/README.md
Lines changed: 0 additions & 57 deletions
diff --git a/‎demo/auto-compression/README.md renamed to ‎demo/auto_compression/README.md
Lines changed: 2 additions & 2 deletions b/‎demo/auto-compression/README.md renamed to ‎demo/auto_compression/README.md
Lines changed: 2 additions & 2 deletions
diff --git a/‎demo/auto-compression/configs/CV/mbv2_ptq_hpo.yaml renamed to ‎demo/auto_compression/configs/CV/mbv2_ptq_hpo.yaml b/‎demo/auto-compression/configs/CV/mbv2_ptq_hpo.yaml renamed to ‎demo/auto_compression/configs/CV/mbv2_ptq_hpo.yaml
diff --git a/‎demo/auto-compression/configs/CV/mbv2_qat_dis.yaml renamed to ‎demo/auto_compression/configs/CV/mbv2_qat_dis.yaml b/‎demo/auto-compression/configs/CV/mbv2_qat_dis.yaml renamed to ‎demo/auto_compression/configs/CV/mbv2_qat_dis.yaml
diff --git a/‎demo/auto-compression/configs/NLP/bert_asp_dis.yaml renamed to ‎demo/auto_compression/configs/NLP/bert_asp_dis.yaml b/‎demo/auto-compression/configs/NLP/bert_asp_dis.yaml renamed to ‎demo/auto_compression/configs/NLP/bert_asp_dis.yaml
diff --git a/‎demo/auto-compression/configs/NLP/bert_ptq_hpo.yaml renamed to ‎demo/auto_compression/configs/NLP/bert_ptq_hpo.yaml b/‎demo/auto-compression/configs/NLP/bert_ptq_hpo.yaml renamed to ‎demo/auto_compression/configs/NLP/bert_ptq_hpo.yaml
diff --git a/‎demo/auto-compression/configs/NLP/bert_qat_dis.yaml renamed to ‎demo/auto_compression/configs/NLP/bert_qat_dis.yaml b/‎demo/auto-compression/configs/NLP/bert_qat_dis.yaml renamed to ‎demo/auto_compression/configs/NLP/bert_qat_dis.yaml
diff --git a/‎demo/auto-compression/demo_glue.py renamed to ‎demo/auto_compression/demo_glue.py b/‎demo/auto-compression/demo_glue.py renamed to ‎demo/auto_compression/demo_glue.py
diff --git a/‎demo/auto-compression/demo_imagenet.py renamed to ‎demo/auto_compression/demo_imagenet.py b/‎demo/auto-compression/demo_imagenet.py renamed to ‎demo/auto_compression/demo_imagenet.py
diff --git a/‎demo/auto_compression/detection/README.md
Lines changed: 85 additions & 0 deletions b/‎demo/auto_compression/detection/README.md
Lines changed: 85 additions & 0 deletions
@@ -50,9 +50,9 @@ python tools/export_model.py \
     -o Global.save_inference_dir=infermodel_mobilenetv2
 ```
 #### 2.4 测试模型精度
-拷贝``infermodel_mobilenetv2``文件夹到``PaddleSlim/demo/auto-compression/``文件夹。
+拷贝``infermodel_mobilenetv2``文件夹到``PaddleSlim/demo/auto_compression/``文件夹。
 ```
-cd PaddleSlim/demo/auto-compression/
+cd PaddleSlim/demo/auto_compression/
 ```
 使用[eval.py](../quant/quant_post/eval.py)脚本得到模型的分类精度，压缩后的模型也可以使用同一个脚本测试精度：
 ```
 
@@ -0,0 +1,85 @@
+# 目标检测模型自动压缩
+
+本示例将介绍如何使用PaddleDetection中Inference部署模型进行自动压缩。
+
+## Benchmark
+
+- PP-YOLOE模型：
+
+| 模型  |  策略  | 输入尺寸 | mAP<sup>val<br>0.5:0.95 | 预测时延<sup><small>FP32</small><sup><br><sup>(ms) |预测时延<sup><small>FP32</small><sup><br><sup>(ms) | 预测时延<sup><small>INT8</small><sup><br><sup>(ms) |  配置文件 | Inference模型  |
+| :-------- |:-------- |:--------: | :---------------------: | :----------------: | :----------------: | :---------------: | :-----------------------------: | :-----------------------------: |
+| PP-YOLOE-l |  Base模型 | 640*640  |  50.9   |   11.2  |   7.7ms   |  -  |  [config](https://github.com/PaddlePaddle/PaddleDetection/blob/develop/configs/ppyoloe/ppyoloe_crn_l_300e_coco.yml) | [Model](https://bj.bcebos.com/v1/paddle-slim-models/detection/ppyoloe_crn_l_300e_coco.tar) |
+| PP-YOLOE-l |  量化+蒸馏 | 640*640  |  49.5   |   - |   -   |  6.7ms  |  [config](https://github.com/PaddlePaddle/PaddleSlim/tree/develop/demo/auto_compression/detection/configs/ppyoloe_l_qat_dist.yaml) | [Model](https://bj.bcebos.com/v1/paddle-slim-models/detection/ppyoloe_crn_l_300e_coco_quant.tar) |
+
+- mAP的指标均在COCO val2017数据集中评测得到。
+- PP-YOLOE模型在Tesla V100的GPU环境下测试，测试脚本是[benchmark demo](https://github.com/PaddlePaddle/PaddleDetection/tree/release/2.4/deploy/python)
+
+## 环境准备
+
+### 1. 准备数据
+
+本案例默认以COCO数据进行自动压缩实验，如果自定义COCO数据，或者其他格式数据，请参考[PaddleDetection数据准备文档](https://github.com/PaddlePaddle/PaddleDetection/blob/release/2.4/docs/tutorials/PrepareDataSet.md) 来准备数据。
+
+如果数据集为非COCO格式数据，请修改[configs](./configs)中reader配置文件中的Dataset字段。
+
+### 2. 准备需要量化的环境
+
+- PaddlePaddle >= 2.2
+- PaddleDet >= 2.4
+
+```shell
+pip install paddledet
+```
+
+注：安装PaddleDet的目的是为了直接使用PaddleDetection中的Dataloader组件。
+
+### 3. 准备待量化的部署模型
+
+如果已经准备好部署的`model.pdmodel`和`model.pdiparams`部署模型，跳过此步。
+
+根据[PaddleDetection文档](https://github.com/PaddlePaddle/PaddleDetection/blob/develop/docs/tutorials/GETTING_STARTED_cn.md#8-%E6%A8%A1%E5%9E%8B%E5%AF%BC%E5%87%BA) 导出Inference模型，具体可参考下方PP-YOLOE模型的导出示例：
+- 下载代码
+```
+git clone https://github.com/PaddlePaddle/PaddleDetection.git
+```
+- 导出预测模型
+
+```shell
+python tools/export_model.py \
+        -c configs/ppyoloe/ppyoloe_crn_l_300e_coco.yml \
+        -o weights=https://paddledet.bj.bcebos.com/models/ppyoloe_crn_l_300e_coco.pdparams \
+        trt=True \
+```
+
+**注意**：PP-YOLOE导出时设置`trt=True`旨在优化在TensorRT上的性能，其他模型不需要设置`trt=True`。
+
+或直接下载：
+```shell
+wget https://bj.bcebos.com/v1/paddle-slim-models/detection/ppyoloe_crn_l_300e_coco.tar
+tar -xf ppyoloe_crn_l_300e_coco.tar
+```
+
+### 4. 测试模型精度
+
+使用[run_main.py](run_main.py)脚本得到模型的mAP：
+```
+python3.7 run_main.py --config_path=./configs/ppyoloe_l_qat_dist.yaml --eval=True
+```
+
+**注意**：TinyPose模型暂不支持精度测试。
+
+## 开始自动压缩
+
+### 进行量化蒸馏自动压缩
+蒸馏量化自动压缩示例通过[run_main.py](run_main.py)脚本启动，会使用接口``paddleslim.auto_compression.AutoCompression``对模型进行量化训练。具体运行命令为：
+```
+python run_main.py --config_path=./configs/ppyoloe_l_qat_dist.yaml --save_dir='./output/' --devices='gpu'
+```
+
+
+## 部署
+
+可以参考[PaddleDetection部署教程](https://github.com/PaddlePaddle/PaddleDetection/tree/release/2.4/deploy)：
+- GPU上量化模型开启TensorRT并设置trt_int8模式进行部署；
+- CPU上可参考[X86 CPU部署量化模型教程](https://github.com/PaddlePaddle/Paddle-Inference-Demo/blob/master/docs/optimize/paddle_x86_cpu_int8.md)；
+- 移动端请直接使用[Paddle Lite Demo](https://github.com/PaddlePaddle/PaddleDetection/tree/release/2.4/deploy/lite)部署。