hi, table 1 the text-image retrieval results is finetune and test in the same dataset in the paper? have you try train on MSCOCO and test on Flickr?