Skip to content

Commit c1730f0

Browse files
committed
Update version to 0.7.0.0
1 parent 699f81c commit c1730f0

File tree

3 files changed

+38
-3
lines changed

3 files changed

+38
-3
lines changed

README.md

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -251,6 +251,38 @@ bounding box, detection probability and detection number.
251251

252252
## Changelog:
253253

254+
### 2021-11-06 v0.7.0.0
255+
256+
Since a lot of updates happened since last release version is updated straight to v0.7.0.0
257+
258+
Additions:
259+
- Added experimental support for msgpack serializer: helps reduce network traffic for embeddings for ~2x.
260+
- Output names no longer required for detection models when building TRT engine - correct output order is now extracted
261+
from onnx models.
262+
- Detection models now can be exported to TRT engine with batch size > 1 - inference code doesn't support it yet, though
263+
now they could be used in Triton Inference Server without issues.
264+
265+
Model Zoo:
266+
- Added support for WebFace600k based recognition models from InsightFace repo: `w600k_r50` and `w600k_mbf`
267+
- Added md5 check for models to allow automatic re-download if models have changed.
268+
- All `scrfd` based models now supports batch dimension/
269+
270+
Improvements:
271+
- 1.5x-2x faster SCRFD re-implementation with Numba: 4.5 ms. vs 10 ms. for `lumia.jpg` example with
272+
`scrfd_10g_gnkps` and threshold = 0.3 (432 faces detected)).
273+
- Move image normalization step to GPU with help of CuPy (4x lower data transfer from CPU to GPU, about 6%
274+
inference speedup, and some computations offloaded from CPU).
275+
- 4.5x Faster `face_align.norm_crop` implementation with help of Numba and removal of unused computations.
276+
(Cropping 432 faces from `lumia.jpg` example tooks 45 ms. vs 205 ms.).
277+
- Face crops are now extracted only when needed - when face data or embeddings are requested, improving
278+
detection only performance.
279+
- Added Numba njit cache to reduce subsequent starts time.
280+
- Logging timings rounded to ms for better readability.
281+
- Minor refactoring
282+
283+
Fixes:
284+
- Since gender/age estimation model is currently not supported exclude it from models preparing step.
285+
254286
### 2021-09-09 v0.6.2.0
255287

256288
REST-API

deploy_trt.sh

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
#! /bin/bash
22

33
IMAGE='insightface-rest'
4-
TAG='v0.6.3.0'
4+
TAG='v0.7.0.0'
55

66
# Change InsightFace-REST logging level (DEBUG,INFO,WARNING,ERROR)
77
log_level=INFO
@@ -42,12 +42,14 @@ force_fp16=False
4242
det_model=scrfd_10g_gnkps
4343

4444
# REC MODELS:
45-
## arcface_r100_v1, glintr100
45+
## arcface_r100_v1, glintr100, w600k_r50, w600k_mbf
4646
rec_model=glintr100
4747
## Do not load recognition model:
4848
rec_ignore=False
4949
## Maximum batch size for recognition model
5050
rec_batch_size=1
51+
## Maximum batch size for detection model, use only for building models for Triton Server
52+
det_batch_size=1
5153

5254
# GENDER/AGE MODELS:
5355
## genderage_v1
@@ -109,6 +111,7 @@ for i in $(seq 0 $(($n_gpu - 1)) ); do
109111
-e REC_NAME=$rec_model\
110112
-e REC_IGNORE=$rec_ignore\
111113
-e REC_BATCH_SIZE=$rec_batch_size\
114+
-e DET_BATCH_SIZE=$det_batch_size\
112115
-e GA_NAME=$ga_model\
113116
-e GA_IGNORE=$ga_ignore\
114117
-e TRITON_URI=$triton_uri\

src/api_trt/app.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@
2020
from env_parser import EnvConfigs
2121
from schemas import BodyDraw, BodyExtract
2222

23-
__version__ = "0.6.3.0"
23+
__version__ = "0.7.0.0"
2424

2525
dir_path = os.path.dirname(os.path.realpath(__file__))
2626

0 commit comments

Comments
 (0)