Im studying with global context of images in VL model So I just want to see some results via your model Can you share some checkpoints? It will grateful for me thanks for your grate work