diff --git a/apix/v1alpha2/inferenceobjective_types.go b/apix/v1alpha2/inferenceobjective_types.go index 07f2c4d7a..53869d9ba 100644 --- a/apix/v1alpha2/inferenceobjective_types.go +++ b/apix/v1alpha2/inferenceobjective_types.go @@ -69,7 +69,7 @@ type InferenceObjectiveSpec struct { // The higher the value, the more critical the request is; negative values _are_ allowed. // No default value is set for this field, allowing for future additions of new fields that may 'one of' with this field. // However, implementations that consume this field (such as the Endpoint Picker) will treat an unset value as '0'. - // Priority is used in flow control, primarily in the event of resource scarcity(reqeusts need to be queued). + // Priority is used in flow control, primarily in the event of resource scarcity(requests need to be queued). // All requests will be queued, and flow control will _always_ allow requests of higher priority to be served first. // Fairness is only enforced and tracked between requests of the same priority. // diff --git a/config/crd/bases/inference.networking.x-k8s.io_inferenceobjectives.yaml b/config/crd/bases/inference.networking.x-k8s.io_inferenceobjectives.yaml index 7f0968f98..27ad4e545 100644 --- a/config/crd/bases/inference.networking.x-k8s.io_inferenceobjectives.yaml +++ b/config/crd/bases/inference.networking.x-k8s.io_inferenceobjectives.yaml @@ -100,7 +100,7 @@ spec: The higher the value, the more critical the request is; negative values _are_ allowed. No default value is set for this field, allowing for future additions of new fields that may 'one of' with this field. However, implementations that consume this field (such as the Endpoint Picker) will treat an unset value as '0'. - Priority is used in flow control, primarily in the event of resource scarcity(reqeusts need to be queued). + Priority is used in flow control, primarily in the event of resource scarcity(requests need to be queued). All requests will be queued, and flow control will _always_ allow requests of higher priority to be served first. Fairness is only enforced and tracked between requests of the same priority. diff --git a/site-src/reference/x-spec.md b/site-src/reference/x-spec.md index bb91323db..9151dcda6 100644 --- a/site-src/reference/x-spec.md +++ b/site-src/reference/x-spec.md @@ -136,7 +136,7 @@ _Appears in:_ | Field | Description | Default | Validation | | --- | --- | --- | --- | -| `priority` _integer_ | Priority defines how important it is to serve the request compared to other requests in the same pool.
Priority is an integer value that defines the priority of the request.
The higher the value, the more critical the request is; negative values _are_ allowed.
No default value is set for this field, allowing for future additions of new fields that may 'one of' with this field.
However, implementations that consume this field (such as the Endpoint Picker) will treat an unset value as '0'.
Priority is used in flow control, primarily in the event of resource scarcity(reqeusts need to be queued).
All requests will be queued, and flow control will _always_ allow requests of higher priority to be served first.
Fairness is only enforced and tracked between requests of the same priority.
Example: requests with Priority 10 will always be served before
requests with Priority of 0 (the value used if Priority is unset or no InfereneceObjective is specified).
Similarly requests with a Priority of -10 will always be served after requests with Priority of 0. | | | +| `priority` _integer_ | Priority defines how important it is to serve the request compared to other requests in the same pool.
Priority is an integer value that defines the priority of the request.
The higher the value, the more critical the request is; negative values _are_ allowed.
No default value is set for this field, allowing for future additions of new fields that may 'one of' with this field.
However, implementations that consume this field (such as the Endpoint Picker) will treat an unset value as '0'.
Priority is used in flow control, primarily in the event of resource scarcity(requests need to be queued).
All requests will be queued, and flow control will _always_ allow requests of higher priority to be served first.
Fairness is only enforced and tracked between requests of the same priority.
Example: requests with Priority 10 will always be served before
requests with Priority of 0 (the value used if Priority is unset or no InfereneceObjective is specified).
Similarly requests with a Priority of -10 will always be served after requests with Priority of 0. | | | | `poolRef` _[PoolObjectReference](#poolobjectreference)_ | PoolRef is a reference to the inference pool, the pool must exist in the same namespace. | | Required: \{\}
|