Skip to content

Commit 158e5ae

Browse files
Merge pull request #18020 from hungry1526/main
Update PS command to fix the prompt issue
2 parents ec14065 + 3a0b04d commit 158e5ae

File tree

1 file changed

+5
-9
lines changed

1 file changed

+5
-9
lines changed

AKS-Arc/deploy-ai-model.md

Lines changed: 5 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -4,9 +4,9 @@ description: Learn how to deploy an AI model on AKS enabled by Azure Arc with th
44
author: sethmanheim
55
ms.author: sethm
66
ms.topic: how-to
7-
ms.date: 05/20/2025
7+
ms.date: 05/27/2025
88
ms.reviewer: haojiehang
9-
ms.lastreviewed: 05/20/2025
9+
ms.lastreviewed: 05/27/2025
1010

1111
---
1212

@@ -167,12 +167,7 @@ kubectl run -it --rm --restart=Never curl --image=curlimages/curl -- curl -X POS
167167

168168
```powershell
169169
$CLUSTERIP = $(kubectl get svc workspace-llm -o jsonpath="{.spec.clusterIPs[0]}" )
170-
$jsonContent = '{
171-
"model": "phi-3.5-mini-instruct",
172-
"prompt": "What is kubernetes?",
173-
"max_tokens": 20,
174-
"temperature": 0
175-
}'
170+
$jsonContent = '{"model":"phi-3.5-mini-instruct","prompt":"What is kubernetes","max_tokens":200,"temperature":0}' | ConvertTo-Json
176171
177172
kubectl run -it --rm --restart=Never curl --image=curlimages/curl -- curl -X POST http://$CLUSTERIP/v1/completions -H "accept: application/json" -H "Content-Type: application/json" -d $jsonContent
178173
```
@@ -209,4 +204,5 @@ The following table shows the supported GPU models and their corresponding VM SK
209204

210205
## Next steps
211206

212-
In this article, you learned how to deploy an AI model on AKS enabled by Azure Arc with the Kubernetes AI toolchain operator (KAITO). For more information about the KAITO project, see the [KAITO GitHub repo](https://github.com/kaito-project/kaito).
207+
* [Monitor the inference metrics](/azure/aks/ai-toolchain-operator-monitoring) in Managed Prometheus and Managed Grafana
208+
* For more information about KAITO, see [KAITO GitHub Repo](https://github.com/kaito-project/kaito)

0 commit comments

Comments
 (0)