We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent da9271f commit 11cde8dCopy full SHA for 11cde8d
packages/tasks/src/tasks/image-text-to-text/about.md
@@ -91,3 +91,4 @@ curl https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.2-11B-
91
- [SmolVLM - small yet mighty Vision Language Model](https://huggingface.co/blog/smolvlm)
92
- [Multimodal RAG using ColPali and Qwen2-VL](https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb)
93
- [Preference Optimization for Vision Language Models with TRL](https://huggingface.co/blog/dpo_vlm)
94
+- [Image-text-to-text task guide](https://huggingface.co/docs/transformers/tasks/image_text_to_text)
0 commit comments