Add DEIMV2 Object Detection Model #5033

kprokofi · 2025-11-25T13:02:07Z

Summary

resolves #5015

Add DinoV3 and VIT tiny as a backbones for detection, primarily for DeimV2 model
Add DEIMV2 model (OTXModel, Encoder, Decoder), e2e training, export
Experiment with pre-processing, Copy-blend, EMA, learning rate and its schedule, model weights
Add Unit tests, perf tests
Provide final benchmark numbers (vs other DETR variants)

How to test

otx train --config src/otx/recipe/detection/deimv2_l.yaml --data_root tests/assests/car_tree_bug

Checklist

The PR title and description are clear and descriptive
I have manually tested the changes
All changes are covered by automated tests
All related issues are linked to this PR (if applicable)
Documentation has been updated (if applicable)

codecov-commenter · 2025-12-02T22:24:58Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 83.84189% with 233 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
.../native/models/common/layers/transformer_layers.py	76.54%	76 Missing ⚠️
...ive/models/detection/necks/dfine_hybrid_encoder.py	72.41%	48 Missing ⚠️
...kend/native/models/detection/heads/deim_decoder.py	91.46%	28 Missing ⚠️
...kend/native/models/common/layers/position_embed.py	66.66%	26 Missing ⚠️
...x/backend/native/models/common/backbones/dinov3.py	91.87%	16 Missing ⚠️
...brary/src/otx/backend/native/models/utils/utils.py	14.28%	12 Missing ⚠️
...end/native/models/detection/backbones/dinov3sta.py	88.09%	10 Missing ⚠️
...c/otx/backend/native/models/modules/transformer.py	85.71%	5 Missing ⚠️
...backend/native/models/common/losses/gfocal_loss.py	42.85%	4 Missing ⚠️
library/src/otx/backend/native/models/base.py	25.00%	3 Missing ⚠️
... and 2 more

📢 Thoughts on this report? Let us know!

kprokofi · 2025-12-02T22:39:18Z

Model Manifests to be updated after a decision regarding the DETR models we want to expose

Copilot

Pull request overview

This PR adds the DEIMv2 object detection model to the OTX training extensions platform. DEIMv2 is an improved detection transformer that combines a DINOv3 backbone with spatial token attention (STA) and fine-grained distribution refinement (FDR) for enhanced object detection performance.

Key Changes:

Added DEIMv2 model architecture with DINOv3/ViT-Tiny backbone and STA module
Implemented transformer decoder with FDR for bounding box regression
Added comprehensive unit tests and performance benchmarks
Introduced data augmentation scheduling and multi-scale training support

Reviewed changes

Copilot reviewed 43 out of 43 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
library/src/otx/backend/native/models/detection/deimv2.py	DEIMv2 model class with factory pattern for model variants (x/l/m/s)
library/src/otx/backend/native/models/detection/backbones/dinov3sta.py	DINOv3 backbone with Spatial Token Attention for multi-scale features
library/src/otx/backend/native/models/detection/heads/deim_decoder.py	DEIM transformer decoder with FDR mechanism
library/src/otx/backend/native/models/detection/necks/dfine_hybrid_encoder.py	Hybrid encoder with FPN/PAN for feature fusion
library/src/otx/recipe/detection/deimv2_*.yaml	Training recipes for all DEIMv2 variants
library/tests/unit/backend/native/models/detection/test_deimv2.py	Comprehensive unit tests for DEIMv2 model
library/tests/perf_v2/tasks/detection.py	Performance test configuration for DEIMv2 variants

Comments suppressed due to low confidence (1)

library/src/otx/backend/native/models/detection/backbones/dinov3sta.py:1

Duplicate code: lines 533-536 and 537-539 both check if self.eval_spatial_size and generate anchors. The second block overwrites the registered buffers from the first block. Remove the first block (lines 533-536) as it's redundant.

# Copyright (C) 2025 Intel Corporation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

library/tests/perf_v2/tasks/detection.py

library/src/otx/data/transform_libs/torchvision.py

library/src/otx/backend/native/models/utils/utils.py

library/src/otx/backend/native/models/base.py

library/tests/perf_v2/summary.py

library/docs/source/guide/explanation/algorithms/object_detection/object_detection.rst

leoll2 · 2025-12-03T11:18:13Z

library/src/otx/recipe/detection/deimv2_l.yaml

+                  init_args:
+                    scale: [640, 640]
+                    keep_ratio: false
+                - class_path: otx.data.transform_libs.torchvision.RandomFlip


The policy is called no_aug, which means "no augmentation" if my assumption is correct. However, it uses RandomFlip which is an augmentation. Is this intended?

RandomFlip is very basic augmentation that gently enlarge training distribution. It is common to always include this augmentation as default. However, there is no experimental proof of it, just common thing to use.

library/src/otx/recipe/detection/yolox_x.yaml

kprokofi added 2 commits November 25, 2025 04:14

continue exps

2a96ae2

add deimv2

8ab11b9

kprokofi added this to the Geti Tune MVP milestone Nov 25, 2025

kprokofi added the ALGO Any changes in OTX Algo Tasks implementation label Nov 25, 2025

github-actions bot added the TEST Any changes in tests label Nov 25, 2025

kprokofi added 2 commits November 26, 2025 23:50

update policy

5063ecb

dry run

440f815

leoll2 removed this from the Geti Tune MVP milestone Nov 27, 2025

kprokofi added 5 commits November 27, 2025 23:15

experiments with DEIM

12ee697

clean up

48ece32

clean up 2

1b4fa5c

fixed dfine hybrid decoder

8c9bf6d

add typing

1233f4d

kprokofi force-pushed the kp/deimv2 branch from 71588d8 to 1233f4d Compare December 1, 2025 22:00

kprokofi added 7 commits December 2, 2025 08:53

reolve mypy

726608b

remove CopyBlend. Remove EMA. Fix YOLOX recipe

2c8709e

fix all mypy issuea

6fa740d

added unit tests

8ca37e1

small linter fix

daed319

delete embed

38a2d8e

update weights

2db5a4d

update documentation

f53f3ff

github-actions bot added the DOC Improvements or additions to documentation label Dec 2, 2025

kprokofi marked this pull request as ready for review December 2, 2025 22:39

kprokofi requested a review from a team as a code owner December 2, 2025 22:39

Copilot AI review requested due to automatic review settings December 2, 2025 22:39

kprokofi changed the title ~~[WIP] Add DEIMV2 Object Detection Model~~ Add DEIMV2 Object Detection Model Dec 2, 2025

Copilot AI reviewed Dec 2, 2025

View reviewed changes

kprokofi added 2 commits December 3, 2025 07:42

minor change in docs

7225ff1

update perf tests

8a7cce2

leoll2 reviewed Dec 3, 2025

View reviewed changes

kprokofi added 5 commits December 4, 2025 05:56

reply to comments. Add denoising restriction, FlashAttention computation

0ccd03c

add Cuda cache cleaner

e415927

reverted back yaml

e342cec

fix recipes

8d487e2

Merge branch 'develop' into kp/deimv2

08a9a08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add DEIMV2 Object Detection Model #5033

Add DEIMV2 Object Detection Model #5033

Uh oh!

kprokofi commented Nov 25, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Dec 2, 2025

Uh oh!

kprokofi commented Dec 2, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leoll2 Dec 3, 2025

Uh oh!

kprokofi Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add DEIMV2 Object Detection Model #5033

Are you sure you want to change the base?

Add DEIMV2 Object Detection Model #5033

Uh oh!

Conversation

kprokofi commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How to test

Checklist

Uh oh!

codecov-commenter commented Dec 2, 2025

Codecov Report

Uh oh!

kprokofi commented Dec 2, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leoll2 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

kprokofi Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kprokofi commented Nov 25, 2025 •

edited

Loading