Panoptic-Deeplab functional full network #2

ign-saurav · 2025-09-10T23:20:04Z

Overview

This PR integrates Panoptic-DeepLab model support into the Tenstorrent codebase(tt-metal), enabling end-to-end semantic + instance segmentation on TTNN devices .

Input Resolution: (1x3x512x1024)

Source

Panoptic-DeepLab Model Tree

panoptic_deeplab
├── backbone
│ ├── stem
│ └── bottleneck
│
├── semantic_segmentation_head
│ ├── aspp
│ ├── res3
│ ├── res2
│ └── semantic_head
│
└── instance_segmentation_head
├── aspp
├── res3
├── res2
│ ├── center_head
│ └── offset_head

Results

PCC Score

Hardware: Wormhole n150

Head	Real weights and input data	Random weights and input data
Semantic Head	0.990338	0.999981
Instance Center Head	0.989921	1.0
Instance Offset Head	0.994153	0.999999

Tests

Full network test (with real weights and real input from image):
pytest models/experimental/panoptic_deeplab/tests/pcc/test_panoptic_deeplab.py

Demo Test:
python models/experimental/panoptic_deeplab/demo/panoptic_deeplab_demo.py -i models/experimental/panoptic_deeplab/resources/input.png -o <output-dir-path>

Performance:

Total device time : 78,156us
FPS : 12.8

Observations:

At present, the upsampling, depthwise convolution, and copy operations consume the most device time.

Checklist

models/experimental/panoptic_deeplab/reference/res_block.py

models/experimental/panoptic_deeplab/tt/custom_preprocessing.py

models/experimental/panoptic_deeplab/tt/panoptic_deeplab.py

models/experimental/panoptic_deeplab/tt/decoder.py

models/experimental/panoptic_deeplab/tt/common.py

models/experimental/panoptic_deeplab/tt/aspp.py

models/experimental/panoptic_deeplab/tests/test_resnet52_bottleneck.py

models/experimental/panoptic_deeplab/tests/test_aspp.py

ign-navaneethk · 2025-09-16T15:08:48Z

Working on a cleaner reference implementation and weight processing, will push after rebasing properly. ASPP layer will still need some work to align it to Detectron2's implementation of the reference model.

ign-navaneethk · 2025-09-17T06:25:59Z

Working on a cleaner reference implementation and weight processing, will push after rebasing properly. ASPP layer will still need some work to align it to Detectron2's implementation of the reference model.

The changes have been pushed.

ign-saurav · 2025-09-19T05:34:27Z

Hi @mbezuljTT, Could you please review the PR?

mbezuljTT · 2025-09-19T05:51:21Z

what is the resolution for the shared performance?

ign-saurav · 2025-09-19T12:38:49Z

what is the resolution for the shared performance?

Resolution : 1x3x512x1024

mbezuljTT · 2025-09-26T07:54:23Z

@ianastasijevicTT on my team has reviewed your PR; generally it's OK starting point for the functional PanopticDeepLab.

We have our own version of the model being merged on the main tt-metal repo as we speak; it's optimized for a special case of blackhole and 20 cores;

we would have to figure out what is the easiest way to have two flavors of the model; and at this time we don't want to merge this into tt-metal. FYI @mbahnasTT

@ign-saurav if you want to continue optimization of this model we can share some pointers.

ign-saurav · 2025-09-26T09:17:56Z

@ianastasijevicTT on my team has reviewed your PR; generally it's OK starting point for the functional PanopticDeepLab.

We have our own version of the model being merged on the main tt-metal repo as we speak; it's optimized for a special case of blackhole and 20 cores;

we would have to figure out what is the easiest way to have two flavors of the model; and at this time we don't want to merge this into tt-metal. FYI @mbahnasTT

@ign-saurav if you want to continue optimization of this model we can share some pointers.

@mbezuljTT , Sure, please share the pointers for optimization.

ign-saurav changed the title ~~panoptic-deeplab full network functional~~ Panoptic-Deeplab functional full network Sep 11, 2025

ign-msati reviewed Sep 11, 2025

View reviewed changes

models/experimental/panoptic_deeplab/reference/res_block.py Outdated Show resolved Hide resolved

models/experimental/panoptic_deeplab/tt/custom_preprocessing.py Show resolved Hide resolved

ign-msati reviewed Sep 11, 2025

View reviewed changes

ign-saurav and others added 17 commits September 16, 2025 05:18

panoptic-deeplab full network functional

f3123eb

torch output tensor shape error fix

9bd023e

cleanUp WIP

39e0934

cleanUp WIP

89f2af7

cleanUp WIP

b36d3fe

reverted to old custom_preprocessor

d39d81a

decoder test fix

d6de981

[WIP]Adds trained weight loading support

f361094

Trained weight loading for unit tests

e88f046

Fixes layer names in backbone and minor cleanup

75017ea

refactored full net test

912dcfa

clean full net test

5414f87

README WIp

a507e51

custom_preprocessor refactor

8b18461

refactored decoder test

cc07c6e

Demo added

c234a3f

uniform test infra

4e738ec

ign-saurav force-pushed the ign/panoptic_deeplab branch from 6c5b055 to 4e738ec Compare September 16, 2025 05:46

ign-saurav and others added 6 commits September 16, 2025 08:14

fixed errors due to rebase to latest main

5bad0c8

activation default set to None as per latest changes

db25fef

image path fix

aa7eda5

refactored common.py and some other files

05bd832

refactor decoder test and demo bug fix

2860ca7

panoptic output added

cda67fb

enabled enable_act_double_buffer

4f12a5f

ign-saurav and others added 5 commits September 16, 2025 16:33

for res3 reduced size act_block_h due to memory overlapping error

6eb6159

reduced act_block_h in aspp

0bbe5ab

demo file refactoring

a8f8e41

Updates refernce model for reduced checkpoint key mappings

d414cd3

Splits common file and fixes import issues

abfe9d3

ign-navaneethk and others added 6 commits September 17, 2025 07:01

Porting of test files

9ed15f6

Updates ASPP and Res blocks for easier weight loading

71b864c

Fixes copyright string in decoder file

b474ef5

perf_test fix , runner added and fixed multiple run

204f980

updated README file and resolved comments

a0d02ce

cleanup post-processing and demo files

61734ba

ign-saurav marked this pull request as ready for review September 17, 2025 14:27

labels added to panoptic segmentation

f229cbb

instances added for the labels with multiple occurences

5cae782

Panoptic-Deeplab functional full network #2

Are you sure you want to change the base?

Panoptic-Deeplab functional full network #2

Uh oh!

Conversation

ign-saurav commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Input Resolution: (1x3x512x1024)

Source

Panoptic-DeepLab Model Tree

Results

PCC Score

Hardware: Wormhole n150

Tests

Performance:

Observations:

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ign-navaneethk commented Sep 16, 2025

Uh oh!

ign-navaneethk commented Sep 17, 2025

Uh oh!

ign-saurav commented Sep 19, 2025

Uh oh!

mbezuljTT commented Sep 19, 2025

Uh oh!

ign-saurav commented Sep 19, 2025

Uh oh!

mbezuljTT commented Sep 26, 2025

Uh oh!

ign-saurav commented Sep 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

ign-saurav commented Sep 10, 2025 •

edited

Loading