What are the input and output tensors for detection and recognition from a .onnx file? #14659
Replies: 1 comment
-
The input and output tensors for detection and recognition from a 1. Input Tensor:For both detection and recognition models, the input tensor is an image represented as a 4D tensor with the format:
For example, an image input tensor might have a shape like: (1, 3, H, W) # Where H and W are dynamically defined image dimensions. 2. Output Tensors:Detection Model Output:The detection model outputs bounding boxes around detected text regions. The output typically consists of:
Recognition Model Output:The recognition model takes each cropped text image from the detection step and produces the recognized text along with confidence scores.
3. How to Forward the Input and Interpret the Output
By following this workflow, you can use the Response generated by feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
What do I forward and what is the result and how do I interpret it?
Beta Was this translation helpful? Give feedback.
All reactions