You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: update response model field to match routing decision
The response JSON model field now correctly reflects the semantic router's
decision instead of using the model name from the vLLM endpoint.
Changes:
- Parse response JSON and update model field to ctx.RequestModel
- Re-marshal modified response for cache and client
- Only modify non-streaming responses
- Fallback to original response on marshal errors
This ensures API consumers can determine which model was selected by
examining the standard model field, rather than requiring custom headers
or log inspection.
Fixes#430
Co-Authored-By: Claude <[email protected]>
Signed-off-by: Yossi Ovadia <[email protected]>
0 commit comments