Skip to content

[retriever] add option to use multimodal vl embedder#1435

Merged
jperez999 merged 1 commit intoNVIDIA:mainfrom
edknv:edwardk/retriever-vlm-embed
Feb 25, 2026
Merged

[retriever] add option to use multimodal vl embedder#1435
jperez999 merged 1 commit intoNVIDIA:mainfrom
edknv:edwardk/retriever-vlm-embed

Conversation

@edknv
Copy link
Collaborator

@edknv edknv commented Feb 25, 2026

Description

  • Add a LlamaNemotronEmbedVL1BV2Embedder for the nvidia/llama-nemotron-embed-vl-1b-v2 model.
  • Only suppports text modality for now, so reject non-text modalities (image, text_image, image_text). Will follow on with support for non-text modalities.

Usage

python -m retriever.examples.batch_pipeline \
  /path/to/pdfs \
  --embed-model-name nvidia/llama-nemotron-embed-vl-1b-v2 \

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

@edknv edknv requested a review from a team as a code owner February 25, 2026 21:19
@jperez999 jperez999 merged commit 84d310b into NVIDIA:main Feb 25, 2026
6 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants