Paper reproduction question

Hello, your work is fantastic, thanks for open-sourcing it. I encountered an issue while testing the released weights. Could you please clarify which weights should be used as the visual encoder when testing `downstream_commentary_none_open.pth`?

I tried using both `pretrained_both.pth` and `pretrained_contrastive.pth` as the visual encoder weights, but the performance was relatively poor in both cases. Below are the performance metrics:

**pretrained_both.pth + downstream_commentary_none_open.pth:**
`{'BLEU-1': '21.895', 'BLEU-4': '3.428', 'METEOR': '20.290', 'ROUGE-L': '20.156', 'CIDER': '13.593'}`

**pretrained_contrastive.pth + downstream_commentary_none_open.pth:**
`{'BLEU-1': '22.251', 'BLEU-4': '3.037', 'METEOR': '20.946', 'ROUGE-L': '18.706', 'CIDER': '8.955'}`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paper reproduction question #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Paper reproduction question #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions