YML question, EastRandomCropData size -96 -320 which one is height or width #14679

felixho789 · 2025-02-13T23:50:36Z

felixho789
Feb 13, 2025

- EastRandomCropData:
    size:
    - 96
    - 320
    max_tries: 50
    keep_ratio: true

is 96 height or or width ?

Answered by GreatV

Feb 13, 2025

In the configuration:

- EastRandomCropData:
    size:
    - 96
    - 320
    max_tries: 50
    keep_ratio: true

The values under size represent the height and width of the cropped image. Typically, in PaddleOCR and similar frameworks, the convention is:

The first value (96) refers to the height.
The second value (320) refers to the width.

This convention aligns with the way image dimensions are usually specified in deep learning frameworks, where height comes first, followed by width.

If you need further confirmation, you can check other parts of the configuration file, such as image_shape in RecConAug, where similar patterns are followed:

image_shape:
- 48
- 320
- 3

Here, 48 is the hei…

View full answer

GreatV · 2025-02-13T23:51:20Z

GreatV
Feb 13, 2025
Maintainer

In the configuration:

- EastRandomCropData:
    size:
    - 96
    - 320
    max_tries: 50
    keep_ratio: true

The values under size represent the height and width of the cropped image. Typically, in PaddleOCR and similar frameworks, the convention is:

The first value (96) refers to the height.
The second value (320) refers to the width.

This convention aligns with the way image dimensions are usually specified in deep learning frameworks, where height comes first, followed by width.

If you need further confirmation, you can check other parts of the configuration file, such as image_shape in RecConAug, where similar patterns are followed:

image_shape:
- 48
- 320
- 3

Here, 48 is the height and 320 is the width. This supports the interpretation that size: [96, 320] in EastRandomCropData also follows the same order.

Response generated by feifei-bot | chatgpt-4o-latest

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

YML question, EastRandomCropData size -96 -320 which one is height or width #14679

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

YML question, EastRandomCropData size -96 -320 which one is height or width #14679

Uh oh!

felixho789 Feb 13, 2025

Replies: 1 comment

Uh oh!

GreatV Feb 13, 2025 Maintainer

felixho789
Feb 13, 2025

GreatV
Feb 13, 2025
Maintainer