Skip to content

Commit e0f7e72

Browse files
authored
feat(infr): add list of known working custom models
1 parent 0543961 commit e0f7e72

File tree

1 file changed

+15
-1
lines changed

1 file changed

+15
-1
lines changed

pages/managed-inference/reference-content/supported-models.mdx

Lines changed: 15 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,7 @@ You can find a complete list of all models available in Scaleway's catalog on th
3434
We recommend starting with a variation of a supported model from the Scaleway catalog.
3535
For example, you can deploy a [quantized (4-bit) version of Llama 3.3](https:/huggingface.co/unsloth/Llama-3.3-70B-Instruct-bnb-4bit).
3636
If deploying a fine-tuned version of Llama 3.3, make sure your file structure matches the example linked above.
37+
Examples whose compatibility has been tested are available in [tested models](#known-compatible-models).
3738
</Message>
3839

3940
To deploy a custom model via Hugging Face, ensure the following:
@@ -232,4 +233,17 @@ Custom models must conform to one of the architectures listed below. Click to ex
232233
* `EAGLEModel`
233234
* `MedusaModel`
234235
* `MLPSpeculatorPreTrainedModel`
235-
</Concept>
236+
</Concept>
237+
238+
## Known compatible models
239+
240+
Several models have already been verified to work on Managed Inference Custom models. This list is not exhaustive and is updated gradually. Click to expand full list.
241+
242+
<Concept>
243+
## Models verified for compatibility
244+
The following model compatibility has been verified:
245+
* `ibm-granite/granite-vision-3.2-2b`
246+
* `ibm-granite/granite-3.3-2b-instruct`
247+
* `microsoft/phi-4`
248+
* `Qwen/Qwen3-32B`
249+
</Concept>

0 commit comments

Comments
 (0)