-
Notifications
You must be signed in to change notification settings - Fork 14
Description
Hello, your work is fantastic, thanks for open-sourcing it. I encountered an issue while testing the released weights. Could you please clarify which weights should be used as the visual encoder when testing downstream_commentary_none_open.pth?
I tried using both pretrained_both.pth and pretrained_contrastive.pth as the visual encoder weights, but the performance was relatively poor in both cases. Below are the performance metrics:
pretrained_both.pth + downstream_commentary_none_open.pth:
{'BLEU-1': '21.895', 'BLEU-4': '3.428', 'METEOR': '20.290', 'ROUGE-L': '20.156', 'CIDER': '13.593'}
pretrained_contrastive.pth + downstream_commentary_none_open.pth:
{'BLEU-1': '22.251', 'BLEU-4': '3.037', 'METEOR': '20.946', 'ROUGE-L': '18.706', 'CIDER': '8.955'}