Skip to content

Commit 960fa7b

Browse files
authored
fix(inference): changed to full names everywhere (#3871)
1 parent 21ee82b commit 960fa7b

11 files changed

+13
-23
lines changed

ai-data/managed-inference/how-to/managed-inference-with-private-network.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -91,7 +91,7 @@ Using a Private Network for communications between your Instances hosting your a
9191
import requests
9292

9393
PAYLOAD = {
94-
"model": "<MODEL_DEPLOYED>", # EXAMPLE= meta/llama-3-8b-instruct:bf16
94+
"model": "<MODEL_DEPLOYED>", # EXAMPLE= meta/llama-3.1-8b-instruct:fp8
9595
"messages": [
9696
{"role": "system",
9797
"content": "You are a helpful, respectful and honest assistant."},

ai-data/managed-inference/reference-content/llama-3-70b-instruct.mdx

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,6 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Meta](https://llama.meta.com/llama3/) |
20-
| Model Name | `llama-3-70b-instruct` |
2120
| Compatible Instances | H100 (FP8) |
2221
| Context size | 8192 tokens |
2322

@@ -62,7 +61,7 @@ curl -s \
6261
-H "Content-Type: application/json" \
6362
--request POST \
6463
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
65-
--data '{"model":"llama-3-70b-instruct", "messages":[{"role": "user","content": "Sing me a song about Xavier Niel"}], "max_tokens": 500, "top_p": 1, "temperature": 0.7, "stream": false}'
64+
--data '{"model":"meta/llama-3-70b-instruct:fp8", "messages":[{"role": "user","content": "Sing me a song about Xavier Niel"}], "max_tokens": 500, "top_p": 1, "temperature": 0.7, "stream": false}'
6665
```
6766

6867
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/llama-3-8b-instruct.mdx

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,6 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Meta](https://llama.meta.com/llama3/) |
20-
| Model Name | `llama-3-8b-instruct` |
2120
| Compatible Instances | L4, H100 (FP8, BF16) |
2221
| Context size | 8192 tokens |
2322

@@ -66,7 +65,7 @@ curl -s \
6665
-H "Content-Type: application/json" \
6766
--request POST \
6867
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
69-
--data '{"model":"llama-3-8b-instruct", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "top_p": 1, "temperature": 0.7, "stream": false}'
68+
--data '{"model":"meta/llama-3-8b-instruct:fp8", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "top_p": 1, "temperature": 0.7, "stream": false}'
7069
```
7170

7271
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/llama-3.1-70b-instruct.mdx

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,7 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Meta](https://llama.meta.com/llama3/) |
20-
| License | [Llama 3.1 community](https://llama.meta.com/llama3_1/license/) |
21-
| Model Name | `llama-3.1-70b-instruct` |
20+
| License | [Llama 3.1 community](https://llama.meta.com/llama3_1/license/) | |
2221
| Compatible Instances | H100 (FP8), H100-2 (FP8, BF16) |
2322
| Context Length | up to 128k tokens |
2423

@@ -61,7 +60,7 @@ curl -s \
6160
-H "Content-Type: application/json" \
6261
--request POST \
6362
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
64-
--data '{"model":"llama-3.1-70b-instruct", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "temperature": 0.7, "stream": false}'
63+
--data '{"model":"meta/llama-3.1-70b-instruct:fp8", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "temperature": 0.7, "stream": false}'
6564
```
6665

6766
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/llama-3.1-8b-instruct.mdx

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,6 @@ categories:
1818
|-----------------|------------------------------------|
1919
| Provider | [Meta](https://llama.meta.com/llama3/) |
2020
| License | [Llama 3.1 community](https://llama.meta.com/llama3_1/license/) |
21-
| Model Name | `llama-3.1-8b-instruct` |
2221
| Compatible Instances | L4, H100, H100-2 (FP8, BF16) |
2322
| Context Length | up to 128k tokens |
2423

@@ -62,7 +61,7 @@ curl -s \
6261
-H "Content-Type: application/json" \
6362
--request POST \
6463
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
65-
--data '{"model":"llama-3.1-8b-instruct", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "temperature": 0.7, "stream": false}'
64+
--data '{"model":"meta/llama-3.1-8b-instruct:fp8", "messages":[{"role": "user","content": "There is a llama in my garden, what should I do?"}], "max_tokens": 500, "temperature": 0.7, "stream": false}'
6665
```
6766

6867
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/mistral-7b-instruct-v0.3.mdx

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -17,14 +17,13 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Mistral](https://mistral.ai/technology/#models) |
20-
| Model Name | `mistral-7b-instruct-v0.3` |
2120
| Compatible Instances | L4 (BF16) |
2221
| Context size | 32K tokens |
2322

2423
## Model name
2524

2625
```bash
27-
mistral-7b-instruct-v0.3:bf16
26+
mistral/mistral-7b-instruct-v0.3:bf16
2827
```
2928

3029
## Compatible Instances
@@ -55,7 +54,7 @@ curl -s \
5554
-H "Content-Type: application/json" \
5655
--request POST \
5756
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
58-
--data '{"model":"mistral-7b-instruct-v0.3", "messages":[{"role": "user","content": "Explain Public Cloud in a nutshell."}], "top_p": 1, "temperature": 0.7, "stream": false}'
57+
--data '{"model":"mistral/mistral-7b-instruct-v0.3:bf16", "messages":[{"role": "user","content": "Explain Public Cloud in a nutshell."}], "top_p": 1, "temperature": 0.7, "stream": false}'
5958
```
6059

6160
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/mistral-nemo-instruct-2407.mdx

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -17,14 +17,13 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Mistral](https://mistral.ai/technology/#models) |
20-
| Model Name | `mistral-nemo-instruct-2407` |
2120
| Compatible Instances | H100 (FP8) |
2221
| Context size | 128K tokens |
2322

2423
## Model name
2524

2625
```bash
27-
mistral-nemo-instruct-2407:fp8
26+
mistral/mistral-nemo-instruct-2407:fp8
2827
```
2928

3029
## Compatible Instances
@@ -61,7 +60,7 @@ curl -s \
6160
-H "Content-Type: application/json" \
6261
--request POST \
6362
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
64-
--data '{"model":"mistral-nemo-instruct-2407", "messages":[{"role": "user","content": "Sing me a song about Xavier Niel"}], "top_p": 1, "temperature": 0.35, "stream": false}'
63+
--data '{"model":"mistral/mistral-nemo-instruct-2407:fp8", "messages":[{"role": "user","content": "Sing me a song about Xavier Niel"}], "top_p": 1, "temperature": 0.35, "stream": false}'
6564
```
6665

6766
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/mixtral-8x7b-instruct-v0.1.mdx

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,6 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Mistral](https://mistral.ai/technology/#models) |
20-
| Model Name | `mixtral-8x7b-instruct-v0.1` |
2120
| Compatible Instances | H100 (FP8) - H100-2 (FP16) |
2221
| Context size | 32k tokens |
2322

@@ -57,7 +56,7 @@ curl -s \
5756
-H "Content-Type: application/json" \
5857
--request POST \
5958
--url "https://<Deployment UUID>.ifr.fr-par.scaleway.com/v1/chat/completions" \
60-
--data '{"model":"mixtral-8x7b-instruct-v0.1", "messages":[{"role": "user","content": "Sing me a song about Scaleway"}], "max_tokens": 200, "top_p": 1, "temperature": 1, "stream": false}'
59+
--data '{"model":"mistral/mixtral-8x7b-instruct-v0.1:fp8", "messages":[{"role": "user","content": "Sing me a song about Scaleway"}], "max_tokens": 200, "top_p": 1, "temperature": 1, "stream": false}'
6160
```
6261

6362
Make sure to replace `<IAM API key>` and `<Deployment UUID>` with your actual [IAM API key](/identity-and-access-management/iam/how-to/create-api-keys/) and the Deployment UUID you are targeting.

ai-data/managed-inference/reference-content/pixtral-12b-2409.mdx

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,6 @@ categories:
1717
| Attribute | Details |
1818
|-----------------|------------------------------------|
1919
| Provider | [Mistral](https://mistral.ai/technology/#models) |
20-
| Model Name | `pixtral-12b-2409` |
2120
| Compatible Instances | H100, H100-2 (bf16) |
2221
| Context size | 128k tokens |
2322

ai-data/managed-inference/reference-content/sentence-t5-xxl.mdx

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,11 +15,10 @@ categories:
1515
| Attribute | Details |
1616
|-----------------|------------------------------------|
1717
| Provider | [sentence-transformers](https://www.sbert.net/) |
18-
| Model Name | `sentence-t5-xxl` |
1918
| Compatible Instances | L4 (FP32) |
2019
| Context size | 512 tokens |
2120

22-
## Model names
21+
## Model name
2322

2423
```bash
2524
sentence-transformers/sentence-t5-xxl:fp32

0 commit comments

Comments
 (0)