Hi! Thank you for this stellar work.
I was wondering if there it would be possible to pass pre-computed image embeddings (image hidden states) to Gemma3's forward pass such that the vision tower would be bypassed?
Is this currently possible? Is it a feature that could possibly be implemented?
Thank you!