docs: add NVIDIA Dynamo integration proposal #373

Xunzhuo · 2025-10-08T15:35:08Z

What type of PR is this?

docs: add NVIDIA Dynamo integration proposal

netlify · 2025-10-08T15:35:17Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`5fabc12`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68e68ca6794cde0008672b95
😎 Deploy Preview	https://deploy-preview-373--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-10-08T15:35:19Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `website`

Owners: @Xunzhuo
Files changed:

website/docs/proposals/nvidia-dynamo-integration.md
website/sidebars.ts

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

Signed-off-by: bitliu <[email protected]>

rootfs · 2025-10-08T16:17:02Z

website/docs/proposals/nvidia-dynamo-integration.md

+dynamics, competitive landscape, and stakeholder interests in your recommendations.
+```
+
+#### 2.2.2 Fusion Routing Strategy


is this integration depending on or can be continued by the prompt classification improvement?

Nope i think it is not a blocker here.

rootfs · 2025-10-08T16:17:41Z

website/docs/proposals/nvidia-dynamo-integration.md

+
+Semantic Router implements a **multi-signal fusion routing** approach that combines three complementary routing methods (as detailed in the [Prompt Classification Routing proposal](./prompt-classification-routing.md)):
+
+**1. Keyword-Based Routing (Fast Path)**


this looks a potential for sub tasks in the integration.

rootfs · 2025-10-08T16:19:11Z

website/docs/proposals/nvidia-dynamo-integration.md

+| Dimension | Semantic Router Alone | Dynamo Router Alone | **Integrated System** |
+|-----------|----------------------|---------------------|----------------------|
+| **Model Selection** | ✅ Semantic accuracy (14 categories) | ❌ No model awareness | ✅ Best model for task |
+| **Worker Selection** | ❌ No worker awareness | ✅ KV cache optimization | ✅ Optimal worker for model |


i think #227 can help both model and worker selection

rootfs · 2025-10-08T16:19:52Z

website/docs/proposals/nvidia-dynamo-integration.md

+|-----------|----------------------|---------------------|----------------------|
+| **Model Selection** | ✅ Semantic accuracy (14 categories) | ❌ No model awareness | ✅ Best model for task |
+| **Worker Selection** | ❌ No worker awareness | ✅ KV cache optimization | ✅ Optimal worker for model |
+| **Prompt Engineering** | ✅ Domain-aware system prompts | ❌ No prompt optimization | ✅ Optimized CoT & MoE matching |


potentially the system prompt injection could impact the prefix cache, we should also monitor that

rootfs

lgtm, left some ideas for github issues

github-actions bot assigned Xunzhuo Oct 8, 2025

Xunzhuo force-pushed the docs/nvidia-dynamo-integration-proposal branch 2 times, most recently from 0364191 to 3761959 Compare October 8, 2025 15:58

docs: add NVIDIA Dynamo integration proposal

5fabc12

Signed-off-by: bitliu <[email protected]>

Xunzhuo force-pushed the docs/nvidia-dynamo-integration-proposal branch from be24cd5 to 5fabc12 Compare October 8, 2025 16:09

rootfs reviewed Oct 8, 2025

View reviewed changes

rootfs approved these changes Oct 8, 2025

View reviewed changes

rootfs merged commit ee7ca36 into main Oct 8, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: add NVIDIA Dynamo integration proposal #373

docs: add NVIDIA Dynamo integration proposal #373

Uh oh!

Xunzhuo commented Oct 8, 2025

Uh oh!

netlify bot commented Oct 8, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 8, 2025 •

edited

Loading

Uh oh!

rootfs Oct 8, 2025

Uh oh!

Xunzhuo Oct 9, 2025

Uh oh!

rootfs Oct 8, 2025

Uh oh!

rootfs Oct 8, 2025

Uh oh!

rootfs Oct 8, 2025

Uh oh!

rootfs left a comment

Uh oh!

Uh oh!

Uh oh!


		Semantic Router implements a multi-signal fusion routing approach that combines three complementary routing methods (as detailed in the [Prompt Classification Routing proposal](./prompt-classification-routing.md)):

		1. Keyword-Based Routing (Fast Path)

docs: add NVIDIA Dynamo integration proposal #373

docs: add NVIDIA Dynamo integration proposal #373

Uh oh!

Conversation

Xunzhuo commented Oct 8, 2025

Uh oh!

netlify bot commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 website

🎉 Thanks for your contributions!

Uh oh!

rootfs Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Xunzhuo Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

rootfs Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

rootfs Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

rootfs Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

rootfs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

netlify bot commented Oct 8, 2025 •

edited

Loading

github-actions bot commented Oct 8, 2025 •

edited

Loading

📁 `website`