How to fuse QuantizeLinear Node in this case?

I'm newer in QAT work, when convert onnx to trtengine, I scan that "ADD" is run in fp16 mode. Do you have any suggestions? and how to place the Q/DQ in right position?

my bash is:

        trtexec \
        --onnx=${onnx_path} \
        --fp16 \
        --int8 \
        --best \
        --verbose \
        --saveEngine=${trt_path} \
        --warmUp=500 \
        --duration=10 \
        --iterations=100 \
        --useCudaGraph \
        --useSpinWait \
        --noDataTransfers \
        --profilingVerbosity=detailed \
        --minShapes=images:1x3x40x40 \
        --optShapes=images:1x3x640x640 \
        --maxShapes=images:1x3x640x640 \
        > verbose.log

this is my onnx graph:

<img width="565" height="1281" alt="Image" src="https://github.com/user-attachments/assets/5419ad59-2ab0-4f3b-8168-c1e691540d7b" />

this is trtengine graph:

<img width="582" height="1135" alt="Image" src="https://github.com/user-attachments/assets/2cd38eab-0bee-4cdf-ab23-ed23cdcfc648" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to fuse QuantizeLinear Node in this case? #4517

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to fuse QuantizeLinear Node in this case? #4517

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions