Change RT-Detr docs to reflect fixed 640x640 input size #41364

konstantinos-p · 2025-10-06T11:05:25Z

What does this PR do?

The authors of RT-Detr mention that the model was trained on 640x640 images and was meant to be used for inference on 640x640 images. Also, the current implementation has certain quirks that make training/inferring on images of different sizes problematic. For example, the pixel masks used for batching images of varying sizes are discarded. I've added a few lines in the docs to notify the user about these issues.

proj_feats = [self.encoder_input_proj[level](source) for level, (source, mask) in enumerate(features)]

Fixes # (issue)
#41363

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Models:

vision models: @yonigozlan @molbap
documentation: @stevhliu

The authors of RT-Detr mention that the model was trained on 640x640 images and was meant to be used for inference on 640x640 images. Also, the current implementation has certain quirks that make training/inferring on images of different sizes problematic. For example, the pixel masks used for batches of varying image sizes are discarded. I've added a few lines in the docs to notify the user about these issues.

Konstantinos Pitas added 2 commits October 6, 2025 12:44

Batching not possible with variable image sizes

29bda6e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change RT-Detr docs to reflect fixed 640x640 input size #41364

Change RT-Detr docs to reflect fixed 640x640 input size #41364

konstantinos-p commented Oct 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Change RT-Detr docs to reflect fixed 640x640 input size #41364

Are you sure you want to change the base?

Change RT-Detr docs to reflect fixed 640x640 input size #41364

Conversation

konstantinos-p commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Uh oh!

konstantinos-p commented Oct 6, 2025 •

edited

Loading