Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion config/charts/body-based-routing/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ bbr:
image:
name: bbr
hub: registry.k8s.io/gateway-api-inference-extension
tag: v1.0.1
tag: v1.0.2
pullPolicy: Always
port: 9004
healthCheckPort: 9005
Expand Down
2 changes: 1 addition & 1 deletion config/charts/inferencepool/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ inferenceExtension:
image:
name: epp
hub: registry.k8s.io/gateway-api-inference-extension
tag: v1.0.1
tag: v1.0.2
pullPolicy: Always
extProcPort: 9002
env: []
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: CustomResourceDefinition
metadata:
annotations:
api-approved.kubernetes.io: https://github.com/kubernetes-sigs/gateway-api-inference-extension/pull/1173
inference.networking.k8s.io/bundle-version: v1.0.1
inference.networking.k8s.io/bundle-version: v1.0.2
name: inferencepools.inference.networking.k8s.io
spec:
group: inference.networking.k8s.io
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ apiVersion: apiextensions.k8s.io/v1
kind: CustomResourceDefinition
metadata:
annotations:
inference.networking.k8s.io/bundle-version: v1.0.1
inference.networking.k8s.io/bundle-version: v1.0.2
name: inferenceobjectives.inference.networking.x-k8s.io
spec:
group: inference.networking.x-k8s.io
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: CustomResourceDefinition
metadata:
annotations:
api-approved.kubernetes.io: unapproved, experimental-only
inference.networking.k8s.io/bundle-version: v1.0.1
inference.networking.k8s.io/bundle-version: v1.0.2
name: inferencepools.inference.networking.x-k8s.io
spec:
group: inference.networking.x-k8s.io
Expand Down
2 changes: 1 addition & 1 deletion site-src/reference/spec.md
Original file line number Diff line number Diff line change
Expand Up @@ -217,7 +217,7 @@ _Appears in:_

| Field | Description | Default | Validation |
| --- | --- | --- | --- |
| `matchLabels` _object (keys:[LabelKey](#labelkey), values:[LabelValue](#labelvalue))_ | MatchLabels contains a set of required \{key,value\} pairs.<br />An object must match every label in this map to be selected.<br />The matching logic is an AND operation on all entries. | | MaxItems: 64 <br />MinItems: 1 <br /> |
| `matchLabels` _object (keys:[LabelKey](#labelkey), values:[LabelValue](#labelvalue))_ | MatchLabels contains a set of required \{key,value\} pairs.<br />An object must match every label in this map to be selected.<br />The matching logic is an AND operation on all entries. | | MaxProperties: 64 <br />MinProperties: 1 <br /> |


#### LabelValue
Expand Down
6 changes: 0 additions & 6 deletions site-src/reference/x-spec.md
Original file line number Diff line number Diff line change
Expand Up @@ -123,12 +123,6 @@ performance and latency goals for the model. These workloads are
expected to operate within an InferencePool sharing compute capacity with other
InferenceObjectives, defined by the Inference Platform Admin.

InferenceObjective's modelName (not the ObjectMeta name) is unique for a given InferencePool,
if the name is reused, an error will be shown on the status of a
InferenceObjective that attempted to reuse. The oldest InferenceObjective, based on
creation timestamp, will be selected to remain valid. In the event of a race
condition, one will be selected at random.



_Appears in:_
Expand Down
2 changes: 1 addition & 1 deletion version/version.go
Original file line number Diff line number Diff line change
Expand Up @@ -30,5 +30,5 @@ const (
BundleVersionAnnotation = "inference.networking.k8s.io/bundle-version"

// BundleVersion is the value used for labeling the version of the gateway-api-inference-extension.
BundleVersion = "v1.0.1"
BundleVersion = "v1.0.2"
)