Skip to content

Conversation

@IlyasMoutawwakil
Copy link
Member

@IlyasMoutawwakil IlyasMoutawwakil commented Nov 19, 2024

this is a bit more complicated than other tasks because the inputs are not the same across model types (and backends), for example the inputs are the usual input_ids and pixel_values with model types such as blip, blip2, git, ... but are more complex for model types such as idefics, idefics2, qwen2_vl, ... because they handle multiple images in the same prompt with interleaved text and image tokens.

  • idefics
  • idefics2
  • qwen2_vl
  • generic image-text-to-text (blip, blip2, git)

@IlyasMoutawwakil IlyasMoutawwakil changed the title Text Image To Image Support Image Text To Text Support Nov 19, 2024
@IlyasMoutawwakil IlyasMoutawwakil added pytorch [CI] Requires and enables running all PyTorch tests cli_cpu_pytorch cpu [CI] Requires and enables running all CPU tests and removed pytorch [CI] Requires and enables running all PyTorch tests cli_cpu_pytorch labels Nov 20, 2024
@IlyasMoutawwakil IlyasMoutawwakil added cuda [CI] Requires and enables running all CUDA tests misc [CI] Requires and enables running all basic utility tests across multiple platforms labels Nov 21, 2024
@IlyasMoutawwakil
Copy link
Member Author

tests run on cpu and are passing for multiple architectures, if any custom input is needed, don't hesitate to open an issue/pr.
failing tests are unrelated.

@IlyasMoutawwakil IlyasMoutawwakil merged commit 31aa662 into main Nov 22, 2024
57 of 84 checks passed
@IlyasMoutawwakil IlyasMoutawwakil deleted the image-text-to-text branch August 19, 2025 22:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cpu [CI] Requires and enables running all CPU tests cuda [CI] Requires and enables running all CUDA tests misc [CI] Requires and enables running all basic utility tests across multiple platforms

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants