-
Notifications
You must be signed in to change notification settings - Fork 32.2k
Description
Model description
I would like to propose adding SAM3-LiteText. This model introduces a highly efficient, lightweight text-prompting capability to the SAM3 architecture. It offers excellent performance for text-guided segmentation tasks while maintaining a small computational footprint (params reduced by 80%), making it a fantastic candidate for transformers integration. @NielsRogge
The modular implementation should be relatively straightforward. The architecture builds upon SAM3 and replaced it's text encoder by mobileclip text encoders. It should be highly feasible to map its components to native transformers modules, re-using existing ViT and text encoder building blocks where possible.
Open source status
[x] The model implementation is available
[x] The model weights are available
Provide useful links for the implementation
authors: @SimonZeng7108
original repo: https://github.com/SimonZeng7108/efficientsam3/tree/sam3_litetext
weights: https://huggingface.co/Simon7108528/EfficientSAM3/tree/main/sam3_litetext
Open source status
- The model implementation is available
- The model weights are available
Provide useful links for the implementation
No response