Cross-modal retrieval on COCO

Hi! Thank you for your work! I have a question regarding cross-modal retrieval on COCO, I am struggling to reproduce the results reported in the paper. Could you provide more details on your training protocol? Which coefficients are you using for VICReg/what kind of expander architecture/are you doing any further downstream training or do you directly use the encoder embeddings obtained via ssl?

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cross-modal retrieval on COCO #22

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Cross-modal retrieval on COCO #22

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions