feat: support /v1/models in direct response #283

Xunzhuo · 2025-09-29T10:27:22Z

What type of PR is this?

feat: support /v1/models in direct response

netlify · 2025-09-29T10:27:28Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`ce9c4f9`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68dbbadd3bca670008eed1d8
😎 Deploy Preview	https://deploy-preview-283--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-09-29T10:27:36Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `src`

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

src/semantic-router/pkg/extproc/models_endpoint_test.go
src/semantic-router/pkg/extproc/request_handler.go

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

rootfs · 2025-09-29T12:08:19Z

@Xunzhuo can you review this solution too?
#186 (comment)

Xunzhuo · 2025-09-29T13:04:30Z

This need additional xDS config generation which is not standard for InferencePool implementation, so that using direct response. Otherwise we need every Gateway implementation to inject one more cluster to support our models api.

rootfs · 2025-09-29T13:08:25Z

@Xunzhuo sounds good. Can you run precommit? It is ready to go.
cc @JaredforReal

JaredforReal · 2025-09-29T14:27:46Z

Sounds good to me

Signed-off-by: bitliu <[email protected]>

Xunzhuo · 2025-09-30T11:22:44Z

tested and move this forward

Signed-off-by: bitliu <[email protected]> Signed-off-by: liuhy <[email protected]>

Xunzhuo requested review from rootfs and wangchen615 as code owners September 29, 2025 10:27

github-actions bot assigned rootfs, wangchen615 and Xunzhuo Sep 29, 2025

Xunzhuo force-pushed the feat-models branch from ed77087 to 3bf92d4 Compare September 30, 2025 11:10

feat: support /v1/models in direct response

ce9c4f9

Signed-off-by: bitliu <[email protected]>

Xunzhuo force-pushed the feat-models branch from 3bf92d4 to ce9c4f9 Compare September 30, 2025 11:11

Xunzhuo merged commit f982534 into main Sep 30, 2025
9 checks passed

Aias00 pushed a commit to Aias00/semantic-router that referenced this pull request Oct 4, 2025

feat: support /v1/models in direct response (vllm-project#283)

7e456ce

Signed-off-by: bitliu <[email protected]> Signed-off-by: liuhy <[email protected]>

Aias00 pushed a commit to Aias00/semantic-router that referenced this pull request Oct 4, 2025

feat: support /v1/models in direct response (vllm-project#283)

0cade8c

Signed-off-by: bitliu <[email protected]> Signed-off-by: liuhy <[email protected]>

Xunzhuo deleted the feat-models branch October 7, 2025 06:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support /v1/models in direct response #283

feat: support /v1/models in direct response #283

Uh oh!

Xunzhuo commented Sep 29, 2025

Uh oh!

netlify bot commented Sep 29, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 29, 2025 •

edited

Loading

Uh oh!

rootfs commented Sep 29, 2025

Uh oh!

Xunzhuo commented Sep 29, 2025

Uh oh!

rootfs commented Sep 29, 2025

Uh oh!

JaredforReal commented Sep 29, 2025

Uh oh!

Xunzhuo commented Sep 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: support /v1/models in direct response #283

feat: support /v1/models in direct response #283

Uh oh!

Conversation

Xunzhuo commented Sep 29, 2025

Uh oh!

netlify bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 src

🎉 Thanks for your contributions!

Uh oh!

rootfs commented Sep 29, 2025

Uh oh!

Xunzhuo commented Sep 29, 2025

Uh oh!

rootfs commented Sep 29, 2025

Uh oh!

JaredforReal commented Sep 29, 2025

Uh oh!

Xunzhuo commented Sep 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

netlify bot commented Sep 29, 2025 •

edited

Loading

github-actions bot commented Sep 29, 2025 •

edited

Loading

📁 `src`