Resnet Optimization

While the major focus of this repository is to demonstrate the performance of optimization through the use of many stacks or frameworks, it also implements the Resnet CNN (Convolution Neutal Network) architecture for the classicalation of five flowers.

Installation

Clone repository

  git clone https://github.com/rungrodkspeed/resnet50_optimization

Install with pip

  pip install -r requirement.txt

For detail TensorRT backed Installation : https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html

or Install with docker

  docker build -t resnet_optim .

then

  docker run --gpus=1 -it --rm resnet_optim

**Be careful the image size is 19.5 GiB

Optimizations

- smaller size
- faster
- more efficient

Framework	Size (MiB)
Pytorch	195
ONNX	97.4
TensorRT	51.1

Batch(es)	Pytorch (CPU)	Pytorch (GPU)	onnxruntime (CPU)	onnxruntime (GPU)	TensorRT
1	17.94 FPS	5.79 FPS	55.62 FPS	not supported for CUDA 12	395.39 FPS
8	16.82 FPS	18.18 FPS	47.54 FPS	not supported for CUDA 12	1958.92 FPS
16	16.45 FPS	72.59 FPS	36.69 FPS	not supported for CUDA 12	2154.45 FPS
32	16.26 FPS	115.96 FPS	38.22 FPS	not supported for CUDA 12	2335.60 FPS
64	14.43 FPS	CUDA out of memory	38.05 FPS	not supported for CUDA 12	2523.06 FPS

Hardware : AMD Ryzen 7 5800H with Radeon Graphics 3.20 GHz Processor, NVIDIA GeForce RTX 3060 Laptop GPU
nvidia-driver : 531.97
CUDA version : 12.1

Deployment

more efficient about performance by Triton stack.

Pull images NGC triton inference server.
```
nvcr.io/nvidia/tritonserver:23.06-py3
```

Create model repository.

<model-repository-path>/
    <model-name>/
        config.pbtxt
        1/
            model.plan

For more detail : https://github.com/triton-inference-server/server/blob/main/docs/user_guide/model_repository.md

Create config.pbtxt

name: "resnet50"
platform: "tensorrt_plan"
max_batch_size: 128
input [
  {
    name: "input"
    data_type: TYPE_FP32
    dims: [ 3, 224, 224 ]
  }

]
output [
  {
    name: "output"
    data_type: TYPE_FP32
    dims: [ 1000 ]
  }
]

default_model_filename: "resnet50.plan"

Launch Triton Server

docker run --name=resnet-triton-container --shm-size='1g' -d --gpus=1 --rm -p8000:8000 -p8001:8001 -p8002:8002 resnet-triton

Launch Client

Python client

docker run --name=resnet-client-python-container -d --rm -p 8888:8888 resnet-client-python

Golang client

docker run --name=resnet-client-python-container -d --rm -p 8888:8888 resnet-client-golang

Inference by send requests to Triton server.
- Python client
```
python3 /app_python/request.py
```
- Golang client
```
python3 /app_golang/request.py
```

Model Analyzer

Summary about Resnet50 on Triton server.
(For watch more high resolution at /analyzer_result/reports/summaries/resnet50 directory)

From model-analyzer the best config.pbtxt is :

name: "resnet50_config_30"
platform: "tensorrt_plan"
max_batch_size: 64
input {
  name: "input"
  data_type: TYPE_FP32
  dims: 3
  dims: 224
  dims: 224
}
output {
  name: "output"
  data_type: TYPE_FP32
  dims: 1000
}
instance_group {
  count: 4
  kind: KIND_GPU
}
default_model_filename: "resnet50.plan"
dynamic_batching {
}

For deep detail :
(For watch more high resolution at /analyzer_result/reports/detailed/resnet50_config_30 directory)

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
analyzer_result		analyzer_result
app_golang		app_golang
app_python		app_python
converter		converter
inference		inference
models		models
sample		sample
serving		serving
test		test
train		train
utils		utils
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
benchmark.py		benchmark.py
check_onnx_stucture.py		check_onnx_stucture.py
export.py		export.py
request.py		request.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resnet Optimization

Installation

Optimizations

Deployment

Model Analyzer

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ccyrene/resnet50_optimization

Folders and files

Latest commit

History

Repository files navigation

Resnet Optimization

Installation

Optimizations

Deployment

Model Analyzer

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages