Skip to content

Lightweight Semantic Segmentation #308

@ovunctuzel-bc

Description

@ovunctuzel-bc

I was hoping to be able to come up with a design that leverages DinoV3 (smallest distilled transformer) features and produces relatively precise segmentation masks. I'm wondering if anyone has any recommendations since:

  • Linear probing or any head that works with intermediate layers operates on 1/16 resolution features, which results in coarse masks
  • Mask2Former approach is quite heavy, defeating the purpose of using a lightweight distilled model

I'd appreciate any advice.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions