Replies: 1 comment 1 reply
-
@AlexKoff88, @alexsu52, anyone can help from your end |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I am currently working on a project to perform inference with anomaly detection (AD) models on an embedded system and am investigating ways to perform quantization.
Until now I trained a PaDiM model on a custom dataset, which performes very well.
I have already worked through the OpenVINO NNCF notebook but realized that the insertion of "Fake Quantization Layers" does not suit my project's needs.
I also attempted to quantize the trained PaDiM model, which I converted to TensorFlow, using the TensorFlow Lite converter, but this approach also seems to be a dead end.
Is there a way to perform "true" quantization to int8 with nncf?
Or could someone point me in the right direction?
Beta Was this translation helpful? Give feedback.
All reactions