How to interpret output of TensortRT engine inspector?

I have applied post-training quantization to my pretty complex neural network architecture. Now I am using a [TRT engine inspector](https://developer.nvidia.com/docs/drive/drive-os/6.0.5/public/drive-os-tensorrt/api-reference/docs/python/infer/Core/EngineInspector.html) to retrieve information about each individual layer to see if it was quantized or not, as not all layer types and operations are supported for conversion to INT8 by TensorRT.

After collecting layer information and discarding some columns which I do not need I am left with something that I cannot really parse or understand honestly.

I know for a fact the very first layers of my architecture are the very first 3 layers of a ResNet34, so a Convolution, ReLU and MaxPool, followed by another custom convolutional bloc. Why do the first 4 layers in my quantized model show up as having LayerType Constant, Reformat, Constant and Reformat respectively? With names like onnx::Conv_1063, Identity_76, onnx::Conv_1062 and Identity_77?

I have been so far unable to find a guide on how to interpret these LayerTypes unfortunately.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to interpret output of TensortRT engine inspector? #4535

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to interpret output of TensortRT engine inspector? #4535

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions