feat: Media decoder and fetcher options in the MDC #4094

milesial · 2025-11-04T18:21:33Z

Overview:

Building on top of #3971, exposing decoder options and HTTP fetch options ini the MDC and python bindings.

from dynamo.llm import MediaFetcher
f = MediaFetcher()
f.user_agent("dynamo")
f.timeout_ms(15000)
f.allow_direct_ip(True)
f.allow_direct_port(False)
f.allowed_media_domains(["google.com"])

from dynamo.llm import MediaDecoder
d = MediaDecoder()
d.image_decoder({"max_image_width": 4096, "max_image_height": 4096, "max_alloc": 16*1024*1024})

register_llm(
  ...,
  media_decoder=d,
  media_fetcher=f,
)

507323054-8f7b5508-af4e-48c0-899e-609ef80b879c

Summary by CodeRabbit

New Features
- Python API now provides media decoder configuration for local model media processing.
- Media fetcher settings now configurable: user agent, IP and port restrictions, allowed domains, and retrieval timeout.

coderabbitai · 2025-11-05T02:32:11Z

Walkthrough

This PR integrates media preprocessing capabilities into the system. It introduces MediaDecoder and MediaFetcher classes in Python bindings, adds corresponding Rust wrapper types, propagates these fields through LocalModelBuilder and ModelDeploymentCard, and exposes them in the Python API surface.

Changes

Cohort / File(s)	Summary
Python PyO3 Bindings `lib/bindings/python/rust/lib.rs`, `lib/bindings/python/rust/llm/preprocessor.rs`	Introduces two new PyO3-wrapped classes (MediaDecoder and MediaFetcher) with constructors and setter methods, updates register_llm function to accept optional media_decoder and media_fetcher parameters, and propagates them into LocalModelBuilder.
Python API Exports `lib/bindings/python/src/dynamo/llm/__init__.py`	Adds MediaDecoder and MediaFetcher to public exports from dynamo._core.
Rust Model Infrastructure `lib/llm/src/local_model.rs`, `lib/llm/src/model_card.rs`	Extends LocalModelBuilder with media_decoder and media_fetcher optional fields, implements fluent builder methods, and propagates these fields into ModelDeploymentCard during model construction.
Rust Module Re-exports `lib/llm/src/preprocessor/media.rs`	Expands public re-exports to include MediaFetcher alongside existing MediaLoader.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

PyO3 binding correctness: Verify proper inner type wrapping and error handling in preprocessor.rs (especially image_decoder depythonization)
Field propagation: Confirm media_decoder and media_fetcher are consistently threaded through LocalModelBuilder → ModelDeploymentCard across both code paths
Clone implementations: Ensure cloned Option fields maintain correct behavior when propagated to the card
Signature compatibility: Validate that register_llm's new optional parameters correctly map to LocalModelBuilder methods

Poem

🐰 Media decoders hop into sight,
Fetchers bound with rabbits' delight,
Through Python and Rust, we weave them tight,
Builder methods make code take flight,
Cards now carry images, oh what a sight! 🖼️

Pre-merge checks

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately reflects the main change: adding media decoder and fetcher as configurable options in the ModelDeploymentCard (MDC) and exposing them through Python bindings.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between defe5de and 35cdad7.

⛔ Files ignored due to path filters (1)

lib/bindings/python/Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (6)

lib/bindings/python/rust/lib.rs (5 hunks)
lib/bindings/python/rust/llm/preprocessor.rs (2 hunks)
lib/bindings/python/src/dynamo/llm/__init__.py (1 hunks)
lib/llm/src/local_model.rs (7 hunks)
lib/llm/src/model_card.rs (3 hunks)
lib/llm/src/preprocessor/media.rs (1 hunks)

🧰 Additional context used

🧠 Learnings (2)

📚 Learning: 2025-06-24T20:59:35.725Z

Learnt from: ishandhanani
Repo: ai-dynamo/dynamo PR: 1626
File: lib/llm/src/preprocessor.rs:238-239
Timestamp: 2025-06-24T20:59:35.725Z
Learning: In lib/llm/src/preprocessor.rs, the `sampling_options` call in the `preprocess_request` method is placed in the common section after the match statement on `request.prompt_input_type()`, meaning it applies to both `PromptInput::Tokens` and `PromptInput::Text` request types.

Applied to files:

lib/bindings/python/rust/llm/preprocessor.rs

📚 Learning: 2025-09-02T16:46:54.015Z

Learnt from: GuanLuo
Repo: ai-dynamo/dynamo PR: 2714
File: lib/llm/src/discovery/model_entry.rs:38-42
Timestamp: 2025-09-02T16:46:54.015Z
Learning: In lib/llm/src/discovery/model_entry.rs, GuanLuo prefers not to add serde defaults for model_type and model_input fields to keep the specification explicit and avoid user errors, relying on atomic deployment strategy to avoid backward compatibility issues.

Applied to files:

lib/llm/src/model_card.rs
lib/llm/src/local_model.rs
lib/bindings/python/rust/lib.rs

🧬 Code graph analysis (3)

lib/bindings/python/src/dynamo/llm/__init__.py (1)

lib/bindings/python/rust/lib.rs (1)

_core (127-202)

lib/bindings/python/rust/llm/preprocessor.rs (2)

lib/bindings/python/rust/lib.rs (1)

pythonize (753-753)

lib/llm/src/local_model.rs (1)

default (61-86)

lib/bindings/python/rust/lib.rs (1)

lib/llm/src/local_model.rs (6)

runtime_config (172-175)

runtime_config (393-395)

user_data (177-180)

custom_template_path (152-155)

media_decoder (192-195)

media_fetcher (197-200)

🪛 GitHub Actions: Pre Merge Validation of (ai-dynamo/dynamo/refs/pull/4094/merge) by milesial.

lib/bindings/python/src/dynamo/llm/__init__.py

[error] 1-1: pre-commit isort hook failed: files were modified by this hook in lib/bindings/python/src/dynamo/llm/init.py. Command failed: pre-commit run --show-diff-on-failure --color=always --all-files

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (12)

GitHub Check: operator (amd64)
GitHub Check: vllm (amd64)
GitHub Check: operator (arm64)
GitHub Check: sglang (amd64)
GitHub Check: trtllm (amd64)
GitHub Check: trtllm (arm64)
GitHub Check: sglang (arm64)
GitHub Check: Mirror Repository to GitLab
GitHub Check: clippy (lib/bindings/python)
GitHub Check: clippy (launch/dynamo-run)
GitHub Check: clippy (.)
GitHub Check: Build and Test - dynamo

🔇 Additional comments (11)

lib/llm/src/preprocessor/media.rs (1)

10-10: LGTM!

The re-export correctly exposes MediaFetcher alongside MediaLoader, enabling its use in the Python bindings and model configuration.

lib/llm/src/model_card.rs (2)

221-227: LGTM!

The media configuration fields are properly declared with #[serde(default)] attributes, consistent with other optional configuration fields in the struct. Note that these fields are intentionally excluded from the mdcsum() calculation (similar to runtime_config and user_data), which appears correct since they represent operational configuration rather than model identity.

532-533: LGTM!

Initialization to None is correct and consistent with the pattern established for other optional fields in the from_repo_checkout method.

lib/bindings/python/rust/lib.rs (3)

163-164: LGTM!

The Python class registrations correctly expose MediaDecoder and MediaFetcher to the Python module, following the established pattern for other PyO3 classes.

221-221: LGTM!

The signature extension correctly adds the media configuration parameters as optional, maintaining backward compatibility.

Also applies to: 237-238

311-313: LGTM!

The propagation correctly extracts the inner Rust types from the Python wrappers using .map(|m| m.inner) and forwards them to the builder.

lib/bindings/python/rust/llm/preprocessor.rs (1)

80-102: LGTM!

The MediaDecoder Python wrapper correctly provides a default constructor and an image_decoder setter that converts Python dict to Rust type using depythonize, with appropriate error handling.

lib/llm/src/local_model.rs (4)

56-57: LGTM!

The new fields are properly declared and initialized with Default::default(), consistent with other optional configuration fields in the builder.

Also applies to: 83-84

192-200: LGTM!

The builder methods correctly follow the fluent builder pattern, accepting Option<T> and returning &mut Self for method chaining.

237-238: LGTM!

Initializing media components with defaults in mocker mode is appropriate for testing scenarios.

250-251: LGTM!

The propagation of media configuration to the ModelDeploymentCard is correctly implemented in both the path-less and path-ful branches, using .clone() to copy the Option values.

Also applies to: 302-303

lib/bindings/python/rust/llm/preprocessor.rs

lib/bindings/python/src/dynamo/llm/__init__.py

indrajit96

LGTM!

rmccorm4 · 2025-11-07T21:12:06Z

Trying to merge this to skip the flaky failing vllm check: #4188

rmccorm4 · 2025-11-07T22:37:43Z

Updating with main to pull in #4188 to skip flaky vllm test

rmccorm4 · 2025-11-08T01:26:20Z

Pulled in #4198

milesial requested review from a team as code owners November 4, 2025 18:21

pull-request-size bot added the size/L label Nov 4, 2025

milesial force-pushed the alexandrem/frontend-image-decoding-mdc branch from 2e99f5f to 4ab87ec Compare November 4, 2025 19:17

copy-pr-bot bot temporarily deployed to GITLAB November 4, 2025 19:17 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 4, 2025 19:27 Inactive

rmccorm4 requested a review from indrajit96 November 4, 2025 19:59

milesial force-pushed the alexandrem/frontend-image-decoding branch from b015a61 to 3c4aa12 Compare November 4, 2025 21:17

milesial force-pushed the alexandrem/frontend-image-decoding-mdc branch from 4ab87ec to 9bce789 Compare November 4, 2025 21:17

copy-pr-bot bot temporarily deployed to GITLAB November 4, 2025 21:17 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 4, 2025 21:22 Inactive

Base automatically changed from alexandrem/frontend-image-decoding to main November 4, 2025 22:35

pull-request-size bot added size/XL and removed size/L labels Nov 4, 2025

milesial requested review from KrishnanPrash, ayushag-nv, grahamking and krishung5 November 5, 2025 00:56

milesial force-pushed the alexandrem/frontend-image-decoding-mdc branch from 9bce789 to 35cdad7 Compare November 5, 2025 02:26

pull-request-size bot added size/L and removed size/XL labels Nov 5, 2025

copy-pr-bot bot temporarily deployed to GITLAB November 5, 2025 02:26 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 5, 2025 02:30 Inactive

coderabbitai bot reviewed Nov 5, 2025

View reviewed changes

lib/bindings/python/rust/llm/preprocessor.rs Show resolved Hide resolved

lib/bindings/python/src/dynamo/llm/__init__.py Outdated Show resolved Hide resolved

milesial force-pushed the alexandrem/frontend-image-decoding-mdc branch from 35cdad7 to 0f21ad8 Compare November 5, 2025 17:52

copy-pr-bot bot temporarily deployed to GITLAB November 5, 2025 17:52 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 5, 2025 17:54 Inactive

rmccorm4 added the frontend `python -m dynamo.frontend` and `dynamo-run in=http|text|grpc` label Nov 6, 2025

copy-pr-bot bot temporarily deployed to GITLAB November 6, 2025 19:59 Inactive

milesial disabled auto-merge November 6, 2025 22:13

milesial enabled auto-merge (squash) November 6, 2025 22:13

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

96c8578

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 00:15 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 00:16 Inactive

milesial added 2 commits November 6, 2025 16:54

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

95b3083

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

e2aeca3

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 01:50 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 01:51 Inactive

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

c8603aa

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 19:58 Inactive

indrajit96 approved these changes Nov 7, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 20:04 Inactive

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

47fc6e1

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 22:37 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 22:40 Inactive

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

be6a9f8

copy-pr-bot bot temporarily deployed to GITLAB November 8, 2025 00:28 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 8, 2025 00:29 Inactive

rmccorm4 mentioned this pull request Nov 8, 2025

ci: Skip broken etcd_ha tests until fixed to unblock unrelated PRs #4198

Merged

Merge branch 'main' into alexandrem/frontend-image-decoding-mdc

bfdc143

copy-pr-bot bot temporarily deployed to GITLAB November 8, 2025 01:26 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 8, 2025 01:27 Inactive

milesial merged commit 14af074 into main Nov 8, 2025
34 of 36 checks passed

milesial deleted the alexandrem/frontend-image-decoding-mdc branch November 8, 2025 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Media decoder and fetcher options in the MDC #4094

feat: Media decoder and fetcher options in the MDC #4094

milesial commented Nov 4, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Nov 5, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

indrajit96 left a comment

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

feat: Media decoder and fetcher options in the MDC #4094

feat: Media decoder and fetcher options in the MDC #4094

Conversation

milesial commented Nov 4, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

indrajit96 left a comment

Choose a reason for hiding this comment

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

milesial commented Nov 4, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 5, 2025 •

edited

Loading