Accuracy issues moving from AIMET-ONNX to SNPE DLC #4010

ptoupas · 2025-07-18T09:35:18Z

ptoupas
Jul 18, 2025

Hi AIMET team,

I’m experiencing a big accuracy gap between the QDQ ONNX model exported (through the to_onnx_qdq()) and tested with onnxruntime, and the DLC I generate with SNPE using the .onnx and .encodings files produced by the same QuantizationSimModel instance using the export() method and tested with the snpe-net-run.
I am working with the aimet-onnx==2.10.0 and the SNPE==2.32.6 versions of the tools.

I am trying to quantize a pose estimation model (Lite-HRNet) and I am visually evaluating the results on a set of images. Following the exact same pre and post processing steps I am getting almost identical results between the FP32 ONNX, the QDQ ONNX (produced by AIMET), and the FP16 converted DLC. But when I try the INT8 quantized DLC (converted using the .onnx and .encodings from AIMET) the results are way off.

Here is the pipeline I’m following:

Create QuantizationSimModel, and prepare the respective Iterable with the calibration data.
Run the compute_encodings().
Get the QDQ ONNX from the to_onnx_qdq() and the .onnx and .encodings files from the export().
Test the accuracy of the QDQ ONNX model using the onnxruntime. If the accuracy is good enough proceed to next step.
Run the snpe-onnx-to-dlc -i aimet.onnx --quantization_overrides aimet.encodings -> fp32.dlc
Run the snpe-dlc-quant --input_dlc fp32.dlc --override_params --bias_bitwidth 32 (I have tried adding or removing --float_fallback and using or omitting --input_list and all the possible combinations of them).
Run the snpe-dlc-graph-prepare on the resulting DLC.
Execute the model using the snpe-net-run and get the results back to evaluate on the same set of images.

Things I have already tried

Running snpe-dlc-quant with and without --input_list (calibration data).
Running it with and without --float_fallback.
Matching the bit-width flags to my AIMET defaults (--weights_bitwidth 8 --act_bitwidth 16).
Keeping biases in FP-32 with --bias_bitwidth 32.

Questions

Is this the right workflow, or am I missing or miss-configuring any step?
How exactly do --input_list and --override_params interact—does SNPE still recalibrate tensors that are already in .encodings file? What impact does other flags I might use in this step have, like --use_per_channel_quantization, or --weights_bitwidth, --act_bitwidth and so on?
What happens to tensors that are not present in the overrides file—do they get quantised with a different scheme that could explain the accuracy drop?
Can I force AIMET to export encodings for every single weight and activation so nothing is left for SNPE to decide?
If the QDQ ONNX model is accurate in onnxruntime, should the DLC be expected to match once all overrides are honoured, or are there other pitfalls?

I have searched through existing discussions and issues, but most similar posts involve PyTorch models instead of ONNX or use QNN rather than SNPE for deployment; I have tried the suggestions from those threads without success, so I’m opening this new discussion.
Any guidance would be greatly appreciated!

Answered by quic-akhobare

Aug 8, 2025

So here is the support forums link for SNPE (Qualcomm Neural Processing SDK), QNN (Neural Network SDK), QAIRT etc.
https://mysupport.qualcomm.com/supportforums/s/topic/0TO4V0000001XL9WAM/ai-sdks-tools

Note: Please don't use the above for AIMET questions. Just create discussion threads here for AIMET.

View full answer

quic-akhobare · 2025-08-04T23:14:40Z

quic-akhobare
Aug 4, 2025
Maintainer

Hi @ptoupas - Thanks for reaching out to us. And apologies for a delayed response.
Your workflow looks right to me. I am not familiar with all the nitty gritty details of the different SNPE flags. But I am guessing there is either a bug in SNPE, or there is some subtle interplay with the flags being used (either some flag is missing when calling SNPE or some flag is extra).

3 replies

quic-akhobare Aug 4, 2025
Maintainer

I think the best way forward would be to reach the SNPE support forums/team. They don't look at AIMET github discussions. So let us find the right link for you to follow up. We will get back to you shortly.

ptoupas Aug 5, 2025
Author

Hi @quic-akhobare,
Thank you for your response. I'll wait for you to come back then with a link and a contact point to discuss and share that with the SNPE team.

In the meantime, I'd like to double confirm with you about question 5 in my post above. I've been checking the quality of the AIMET quantization through the execution of the exported QDQ ONNX model in onnxruntime and not via the QuantizationSimModel.session.run() after the encodings have been computed. As far as I understand that should be the same internally, but I want to double check and ensure that. Should there two approached considered "same" and is it safe to assume the quality of the quantized model by checking the exported QDQ model in onnxruntime?

quic-mtuttle Aug 5, 2025
Collaborator

Hi @ptoupas, yes that is correct, evaluating the model returned by sim.to_onnx_qdq() should be essentially the same as evaluating QuantizationSimModel.session and is a good way to verify the accuracy of your quantized model after export.

quic-akhobare · 2025-08-08T15:44:01Z

quic-akhobare
Aug 8, 2025
Maintainer

So here is the support forums link for SNPE (Qualcomm Neural Processing SDK), QNN (Neural Network SDK), QAIRT etc.
https://mysupport.qualcomm.com/supportforums/s/topic/0TO4V0000001XL9WAM/ai-sdks-tools

Note: Please don't use the above for AIMET questions. Just create discussion threads here for AIMET.

1 reply

ptoupas Aug 11, 2025
Author

Thank you for providing that. Much appreciated! 🙏

frisk99 · 2025-11-27T10:03:08Z

frisk99
Nov 27, 2025

Hi
I also meet this problem on QCS8550.I have determined that it is not the format (NHWC) problem of model input.
How did you solve the problem?

2 replies

ptoupas Nov 28, 2025
Author

Hi @frisk99,

I still haven’t been able to resolve the issue. According to the QC support team, “there seems to be a bug when changing the datatype during the quantization phase.” As far as I know, their development team is working on a fix, but I haven’t received any updates from them yet.

frisk99 Dec 1, 2025

Thanks for your reply, snpe's own problems seem insoluble to me😭

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accuracy issues moving from AIMET-ONNX to SNPE DLC #4010

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 6 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Accuracy issues moving from AIMET-ONNX to SNPE DLC #4010

Uh oh!

ptoupas Jul 18, 2025

Replies: 3 comments · 6 replies

Uh oh!

quic-akhobare Aug 4, 2025 Maintainer

Uh oh!

quic-akhobare Aug 4, 2025 Maintainer

Uh oh!

ptoupas Aug 5, 2025 Author

Uh oh!

quic-mtuttle Aug 5, 2025 Collaborator

Uh oh!

quic-akhobare Aug 8, 2025 Maintainer

Uh oh!

ptoupas Aug 11, 2025 Author

Uh oh!

frisk99 Nov 27, 2025

Uh oh!

ptoupas Nov 28, 2025 Author

Uh oh!

frisk99 Dec 1, 2025

ptoupas
Jul 18, 2025

Replies: 3 comments 6 replies

quic-akhobare
Aug 4, 2025
Maintainer

quic-akhobare Aug 4, 2025
Maintainer

ptoupas Aug 5, 2025
Author

quic-mtuttle Aug 5, 2025
Collaborator

quic-akhobare
Aug 8, 2025
Maintainer

ptoupas Aug 11, 2025
Author

frisk99
Nov 27, 2025

ptoupas Nov 28, 2025
Author