Skip to content

Commit c6f3df2

Browse files
authored
fix: change the inferenceobjective to use v1 inferencepool (#1338)
* change the example to use v1 inferencepool * updated regression test
1 parent fa83453 commit c6f3df2

File tree

2 files changed

+19
-0
lines changed

2 files changed

+19
-0
lines changed

config/manifests/inferenceobjective.yaml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@ metadata:
55
spec:
66
criticality: 1
77
poolRef:
8+
group: inference.networking.k8s.io
89
name: vllm-llama3-8b-instruct
910
---
1011
apiVersion: inference.networking.x-k8s.io/v1alpha2
@@ -14,6 +15,7 @@ metadata:
1415
spec:
1516
criticality: 2
1617
poolRef:
18+
group: inference.networking.k8s.io
1719
name: vllm-llama3-8b-instruct
1820
---
1921
apiVersion: inference.networking.x-k8s.io/v1alpha2
@@ -23,4 +25,5 @@ metadata:
2325
spec:
2426
criticality: 2
2527
poolRef:
28+
group: inference.networking.k8s.io
2629
name: vllm-llama3-8b-instruct

config/manifests/regression-testing/inferenceobjective.yaml

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@ metadata:
55
spec:
66
criticality: 2
77
poolRef:
8+
group: inference.networking.k8s.io
89
name: vllm-llama3-8b-instruct
910

1011
---
@@ -16,6 +17,7 @@ metadata:
1617
spec:
1718
criticality: 2
1819
poolRef:
20+
group: inference.networking.k8s.io
1921
name: vllm-llama3-8b-instruct
2022

2123
---
@@ -27,6 +29,7 @@ metadata:
2729
spec:
2830
criticality: 2
2931
poolRef:
32+
group: inference.networking.k8s.io
3033
name: vllm-llama3-8b-instruct
3134

3235
---
@@ -38,6 +41,7 @@ metadata:
3841
spec:
3942
criticality: 2
4043
poolRef:
44+
group: inference.networking.k8s.io
4145
name: vllm-llama3-8b-instruct
4246

4347
---
@@ -49,6 +53,7 @@ metadata:
4953
spec:
5054
criticality: 2
5155
poolRef:
56+
group: inference.networking.k8s.io
5257
name: vllm-llama3-8b-instruct
5358

5459
---
@@ -60,6 +65,7 @@ metadata:
6065
spec:
6166
criticality: 2
6267
poolRef:
68+
group: inference.networking.k8s.io
6369
name: vllm-llama3-8b-instruct
6470

6571
---
@@ -71,6 +77,7 @@ metadata:
7177
spec:
7278
criticality: 2
7379
poolRef:
80+
group: inference.networking.k8s.io
7481
name: vllm-llama3-8b-instruct
7582

7683
---
@@ -82,6 +89,7 @@ metadata:
8289
spec:
8390
criticality: 2
8491
poolRef:
92+
group: inference.networking.k8s.io
8593
name: vllm-llama3-8b-instruct
8694

8795
---
@@ -93,6 +101,7 @@ metadata:
93101
spec:
94102
criticality: 2
95103
poolRef:
104+
group: inference.networking.k8s.io
96105
name: vllm-llama3-8b-instruct
97106

98107
---
@@ -104,6 +113,7 @@ metadata:
104113
spec:
105114
criticality: 2
106115
poolRef:
116+
group: inference.networking.k8s.io
107117
name: vllm-llama3-8b-instruct
108118

109119
---
@@ -115,6 +125,7 @@ metadata:
115125
spec:
116126
criticality: 2
117127
poolRef:
128+
group: inference.networking.k8s.io
118129
name: vllm-llama3-8b-instruct
119130

120131
---
@@ -126,6 +137,7 @@ metadata:
126137
spec:
127138
criticality: 2
128139
poolRef:
140+
group: inference.networking.k8s.io
129141
name: vllm-llama3-8b-instruct
130142

131143
---
@@ -137,6 +149,7 @@ metadata:
137149
spec:
138150
criticality: 2
139151
poolRef:
152+
group: inference.networking.k8s.io
140153
name: vllm-llama3-8b-instruct
141154

142155

@@ -149,6 +162,7 @@ metadata:
149162
spec:
150163
criticality: 2
151164
poolRef:
165+
group: inference.networking.k8s.io
152166
name: vllm-llama3-8b-instruct
153167

154168

@@ -161,6 +175,7 @@ metadata:
161175
spec:
162176
criticality: 2
163177
poolRef:
178+
group: inference.networking.k8s.io
164179
name: vllm-llama3-8b-instruct
165180

166181
---
@@ -173,4 +188,5 @@ metadata:
173188
spec:
174189
criticality: 2
175190
poolRef:
191+
group: inference.networking.k8s.io
176192
name: vllm-llama3-8b-instruct

0 commit comments

Comments
 (0)