Skip to content

Commit 46a34fa

Browse files
authored
Update how-to-deploy-models-llama.md
1 parent d8887d5 commit 46a34fa

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

articles/machine-learning/how-to-deploy-models-llama.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -204,7 +204,7 @@ To create a deployment:
204204

205205
---
206206

207-
To learn about billing for Meta Llama models deployed as a serverless API, see [Cost and quota considerations for Meta Llama models deployed as a serverless API](#cost-and-quota-considerations-for-meta-llama-3.1-models-deployed-as-a-serverless-api).
207+
To learn about billing for Meta Llama models deployed as a serverless API, see [Cost and quota considerations for Meta Llama models deployed as a serverless API](#cost-and-quota-considerations-for-meta-llama-31-models-deployed-as-a-serverless-api).
208208

209209
### Consume Meta Llama models as a service
210210

@@ -220,7 +220,7 @@ Models deployed as a service can be consumed using either the chat or the comple
220220
- For completions models, such as `Llama-3-8B`, use the [`<target_url>/v1/completions`](#completions-api) API.
221221
- For chat models, such as `Meta-Llama-3.1-405B-Instruct`, use the [`/chat/completions`](#chat-api) API.
222222

223-
For more information on using the APIs, see the [reference](#reference-for-meta-llama-3.1-models-deployed-a-serverless-api) section.
223+
For more information on using the APIs, see the [reference](#reference-for-meta-llama-31-models-deployed-a-serverless-api) section.
224224

225225
# [Meta Llama 2](#tab/llama-two)
226226

@@ -232,7 +232,7 @@ Models deployed as a service can be consumed using either the chat or the comple
232232
- For completions models, such as `Meta-Llama-2-7B`, use the [`/v1/completions`](#completions-api) API or the [Azure AI Model Inference API](reference-model-inference-api.md) on the route `/completions`.
233233
- For chat models, such as `Meta-Llama-2-7B-Chat`, use the [`/v1/chat/completions`](#chat-api) API or the [Azure AI Model Inference API](reference-model-inference-api.md) on the route `/chat/completions`.
234234

235-
For more information on using the APIs, see the [reference](#reference-for-meta-llama-3.1-models-deployed-a-serverless-api) section.
235+
For more information on using the APIs, see the [reference](#reference-for-meta-llama-31-models-deployed-a-serverless-api) section.
236236

237237
---
238238

0 commit comments

Comments
 (0)