Skip to content

Adding SAM3-LiteText #44205

@SimonZeng7108

Description

@SimonZeng7108

Model description

I would like to propose adding SAM3-LiteText. This model introduces a highly efficient, lightweight text-prompting capability to the SAM3 architecture. It offers excellent performance for text-guided segmentation tasks while maintaining a small computational footprint (params reduced by 80%), making it a fantastic candidate for transformers integration. @NielsRogge

The modular implementation should be relatively straightforward. The architecture builds upon SAM3 and replaced it's text encoder by mobileclip text encoders. It should be highly feasible to map its components to native transformers modules, re-using existing ViT and text encoder building blocks where possible.

Open source status

[x] The model implementation is available

[x] The model weights are available

Provide useful links for the implementation

authors: @SimonZeng7108

original repo: https://github.com/SimonZeng7108/efficientsam3/tree/sam3_litetext

weights: https://huggingface.co/Simon7108528/EfficientSAM3/tree/main/sam3_litetext

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions