File tree Expand file tree Collapse file tree 1 file changed +11
-27
lines changed Expand file tree Collapse file tree 1 file changed +11
-27
lines changed Original file line number Diff line number Diff line change @@ -248,33 +248,17 @@ This quickstart guide is intended for engineers familiar with k8s and model serv
248
248
249
249
Wait until the gateway is ready.
250
250
251
- === "GPU-Based Model Server"
252
-
253
- ```bash
254
- IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}')
255
- PORT=80
256
-
257
- curl -i ${IP}:${PORT}/v1/completions -H 'Content-Type: application/json' -d '{
258
- "model": "food-review",
259
- "prompt": "Write as if you were a critic: San Francisco",
260
- "max_tokens": 100,
261
- "temperature": 0
262
- }'
263
- ```
264
-
265
- === "CPU-Based Model Server"
266
-
267
- ```bash
268
- IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}')
269
- PORT=80
270
-
271
- curl -i ${IP}:${PORT}/v1/completions -H 'Content-Type: application/json' -d '{
272
- "model": "Qwen/Qwen2.5-1.5B-Instruct",
273
- "prompt": "Write as if you were a critic: San Francisco",
274
- "max_tokens": 100,
275
- "temperature": 0
276
- }'
277
- ```
251
+ ``` bash
252
+ IP=$( kubectl get gateway/inference-gateway -o jsonpath=' {.status.addresses[0].value}' )
253
+ PORT=80
254
+
255
+ curl -i ${IP} :${PORT} /v1/completions -H ' Content-Type: application/json' -d ' {
256
+ "model": "food-review",
257
+ "prompt": "Write as if you were a critic: San Francisco",
258
+ "max_tokens": 100,
259
+ "temperature": 0
260
+ }'
261
+ ```
278
262
279
263
### Cleanup
280
264
You can’t perform that action at this time.
0 commit comments