Skip to content

Paper reproduction question #12

@Jiang0903

Description

@Jiang0903

Hello, your work is fantastic, thanks for open-sourcing it. I encountered an issue while testing the released weights. Could you please clarify which weights should be used as the visual encoder when testing downstream_commentary_none_open.pth?

I tried using both pretrained_both.pth and pretrained_contrastive.pth as the visual encoder weights, but the performance was relatively poor in both cases. Below are the performance metrics:

pretrained_both.pth + downstream_commentary_none_open.pth:
{'BLEU-1': '21.895', 'BLEU-4': '3.428', 'METEOR': '20.290', 'ROUGE-L': '20.156', 'CIDER': '13.593'}

pretrained_contrastive.pth + downstream_commentary_none_open.pth:
{'BLEU-1': '22.251', 'BLEU-4': '3.037', 'METEOR': '20.946', 'ROUGE-L': '18.706', 'CIDER': '8.955'}

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions