feat(ai-proxy-multi): support client-driven model selection via models request body field by kjprice · Pull Request #13084 · apache/apisix

kjprice · 2026-03-10T15:47:11Z

Description

When allow_client_model_preference is enabled on the plugin config, clients can include a models array in the request body to specify their preferred model/instance ordering. This enables multiple teams sharing a single gateway to express different model preferences without requiring separate routes.

Changes

Schema (apisix/plugins/ai-proxy/schema.lua):

Added allow_client_model_preference boolean field (default: false) to ai_proxy_multi_schema

Plugin logic (apisix/plugins/ai-proxy-multi.lua):

match_client_models() — matches client models entries against configured instances by model name and optionally provider
pick_preferred_instance() — sequential picker that respects client ordering with rate-limiting awareness
Modified access() to read request body, extract models, reorder instances, and strip models before forwarding
Modified retry_on_error() to fall back through client-preferred order on HTTP 429/5xx

Request body models field supports:

String shorthand: ["gpt-4", "deepseek-chat"]
Object form: [{"provider": "openai", "model": "gpt-4"}]
Mixed: both in the same array

Behavior:

Unrecognized model entries are silently ignored
Instances not in the client's list are appended in original priority order
models field is always stripped before forwarding upstream
When disabled (default), models field is ignored — fully backward compatible

Docs (docs/en/latest/plugins/ai-proxy-multi.md):

Added allow_client_model_preference to attributes table
Added models to request format table
Added "Client-Driven Model Selection" example section

Tests (t/plugin/ai-proxy-multi.client-model-preference.t):

Schema validation (default false, explicit true)
String shorthand model preference
Object form model preference
Fallback to server priority without models field
Unrecognized models ignored
models field stripped from forwarded request
Feature disabled when allow_client_model_preference is false

Resolves #13083

…s request body field When allow_client_model_preference is enabled, clients can include a models array in the request body to specify preferred model ordering. Each element can be a model name string or an object with provider and model fields. The plugin matches entries against configured instances and reorders instance selection accordingly. Resolves apache#13083

Copilot

Pull request overview

This PR adds an opt-in feature to ai-proxy-multi that allows clients to influence AI instance selection by providing a models array in the JSON request body, enabling per-client model/provider preference ordering while keeping instance configuration and auth server-controlled.

Changes:

Adds allow_client_model_preference (default false) to the ai-proxy-multi plugin schema.
Implements client-driven instance reordering and preference-aware retry/fallback behavior in ai-proxy-multi.
Documents the new configuration and request format, and adds a dedicated test suite for the feature.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

File	Description
`apisix/plugins/ai-proxy/schema.lua`	Adds the `allow_client_model_preference` schema field to opt into the feature.
`apisix/plugins/ai-proxy-multi.lua`	Implements request-body parsing for `models`, preferred-instance picking, and retry handling.
`docs/en/latest/plugins/ai-proxy-multi.md`	Documents the new attribute and request `models` field with examples.
`t/plugin/ai-proxy-multi.client-model-preference.t`	Adds test coverage for schema validation, ordering, ignore behavior, and stripping of `models`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-11T03:27:23Z