You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pages/managed-inference/reference-content/supported-models.mdx
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ dates:
9
9
10
10
Scaleway Managed Inference allows you to deploy various AI models, either from:
11
11
12
-
*[Scaleway model catalog](#scaleway-model-catalog): A curated set of ready-to-deploy models available through the [Scaleway console](https:/console.scaleway.com/inference/deployments/) or the [Managed Inference models API](https:/www.scaleway.com/en/developers/api/inference/#path-models-list-models)
12
+
*[Scaleway model catalog](#scaleway-model-catalog): A curated set of ready-to-deploy models available through the [Scaleway console](https://console.scaleway.com/inference/deployments/) or the [Managed Inference models API](https://www.scaleway.com/en/developers/api/inference/#path-models-list-models)
13
13
*[Custom models](#custom-models): Models that you import, typically from sources like Hugging Face.
14
14
15
15
## Scaleway model catalog
@@ -19,14 +19,14 @@ You can find a complete list of all models available in Scaleway's catalog on th
19
19
## Custom models
20
20
21
21
<Messagetype="note">
22
-
Custom model support is currently in **beta**. If you encounter issues or limitations, please report them via our [Slack community channel](https:/scaleway-community.slack.com/archives/C01SGLGRLEA) or [customer support](https:/console.scaleway.com/support/tickets/create?for=product&productName=inference).
22
+
Custom model support is currently in **beta**. If you encounter issues or limitations, please report them via our [Slack community channel](https://scaleway-community.slack.com/archives/C01SGLGRLEA) or [customer support](https://console.scaleway.com/support/tickets/create?for=product&productName=inference).
23
23
</Message>
24
24
25
25
### Prerequisites
26
26
27
27
<Messagetype="tip">
28
28
We recommend starting with a variation of a supported model from the Scaleway catalog.
29
-
For example, you can deploy a [quantized (4-bit) version of Llama 3.3](https:/huggingface.co/unsloth/Llama-3.3-70B-Instruct-bnb-4bit).
29
+
For example, you can deploy a [quantized (4-bit) version of Llama 3.3](https://huggingface.co/unsloth/Llama-3.3-70B-Instruct-bnb-4bit).
30
30
If deploying a fine-tuned version of Llama 3.3, make sure your file structure matches the example linked above.
31
31
Examples whose compatibility has been tested are available in [tested models](#known-compatible-models).
32
32
</Message>
@@ -37,7 +37,7 @@ To deploy a custom model via Hugging Face, ensure the following:
37
37
38
38
* You must have access to the model using your Hugging Face credentials.
39
39
* For gated models, request access through your Hugging Face account.
40
-
* Credentials are not stored, but we recommend using [read or fine-grained access tokens](https:/huggingface.co/docs/hub/security-tokens).
40
+
* Credentials are not stored, but we recommend using [read or fine-grained access tokens](https://huggingface.co/docs/hub/security-tokens).
41
41
42
42
#### Required files
43
43
@@ -46,7 +46,7 @@ Your model repository must include:
46
46
* A `config.json` file containig:
47
47
* An `architectures` array (see [supported architectures](#supported-models-architecture) for the exact list of supported values).
48
48
*`max_position_embeddings`
49
-
* Model weights in the [`.safetensors`](https:/huggingface.co/docs/safetensors/index) format
49
+
* Model weights in the [`.safetensors`](https://huggingface.co/docs/safetensors/index) format
50
50
* A `tokenizer.json` file
51
51
* If your are fine-tuning an existing model, we recommend you use the same `tokenizer.json` file from the base model.
52
52
* A chat template included in either:
@@ -68,7 +68,7 @@ Your model must be one of the following types:
68
68
69
69
<Messagetype="important">
70
70
**Security Notice**<br />
71
-
Models using formats that allow arbitrary code execution, such as Python [`pickle`](https:/docs.python.org/3/library/pickle.html), are **not supported**.
71
+
Models using formats that allow arbitrary code execution, such as Python [`pickle`](https://docs.python.org/3/library/pickle.html), are **not supported**.
0 commit comments