Skip to content

Conversation

tao12345666333
Copy link
Contributor

@tao12345666333 tao12345666333 commented Oct 11, 2025

What type of PR is this?

docs(config): add accuracy/latency/token-efficiency recipes and guide

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #86

Release Notes: Yes/No


AI disclosure: This PR was primarily authored with Warp using GPT-5 high and then hand-reviewed by me. I AM responsible for every change made in this PR. I aimed to keep it aligned with our goals, though I may have missed minor issues. Please flag anything that feels off, I'll fix it quickly.

Copy link

netlify bot commented Oct 11, 2025

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit c72422a
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/68eb02dcf194da00085b4341
😎 Deploy Preview https://deploy-preview-394--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@tao12345666333
Copy link
Contributor Author

Copy link

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 config

Owners: @rootfs
Files changed:

  • config/RECIPES.md
  • config/config.recipe-accuracy.yaml
  • config/config.recipe-latency.yaml
  • config/config.recipe-token-efficiency.yaml

📁 website

Owners: @Xunzhuo
Files changed:

  • website/docs/installation/configuration.md

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

@rootfs
Copy link
Collaborator

rootfs commented Oct 12, 2025

cc @Xunzhuo @JaredforReal the console can use these recipes for system prompt injection

@rootfs rootfs merged commit 35a4d58 into vllm-project:main Oct 12, 2025
7 of 8 checks passed
@tao12345666333 tao12345666333 deleted the feat/recipes-86 branch October 12, 2025 02:57
joyful-ii-V-I pushed a commit to joyful-ii-V-I/semantic-router that referenced this pull request Oct 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Router Config Recipes for Different Model Accuracy, Token Efficiency, and Latency Objectives.

3 participants