From 463c8993ff5f37fb8ca22bc1bb85c0fcf555d138 Mon Sep 17 00:00:00 2001 From: Benedikt Rollik Date: Mon, 12 May 2025 11:39:09 +0200 Subject: [PATCH] docs(infr): update docs managed inference --- .../reference-content/model-catalog.mdx | 4 ++ .../reference-content/supported-models.mdx | 40 ++----------------- 2 files changed, 7 insertions(+), 37 deletions(-) diff --git a/pages/managed-inference/reference-content/model-catalog.mdx b/pages/managed-inference/reference-content/model-catalog.mdx index c141f36c9c..ea0da8a8cd 100644 --- a/pages/managed-inference/reference-content/model-catalog.mdx +++ b/pages/managed-inference/reference-content/model-catalog.mdx @@ -14,6 +14,10 @@ categories: --- A quick overview of available models in Scaleway's catalog and their core attributes. Expand any model below to see usage examples, curl commands, and detailed capabilities. + + For more information about all the models supported in Managed Inference, refer to the [Supported Models in Managed Inference](/managed-inference/reference-content/supported-models/) page. + + ## Models technical summary | Model name | Provider | Maximum Context length (tokens) | Modalities | Instances | License | diff --git a/pages/managed-inference/reference-content/supported-models.mdx b/pages/managed-inference/reference-content/supported-models.mdx index 7cd1cfcb9e..ff784a9568 100644 --- a/pages/managed-inference/reference-content/supported-models.mdx +++ b/pages/managed-inference/reference-content/supported-models.mdx @@ -15,46 +15,12 @@ categories: Scaleway Managed Inference allows you to deploy various AI models, either from: - * [Scaleway catalog](#scaleway-catalog): A curated set of ready-to-deploy models available through the [Scaleway console](https:/console.scaleway.com/inference/deployments/) or the [Managed Inference models API](https:/www.scaleway.com/en/developers/api/inference/#path-models-list-models) + * [Scaleway model catalog](#scaleway-model-catalog): A curated set of ready-to-deploy models available through the [Scaleway console](https:/console.scaleway.com/inference/deployments/) or the [Managed Inference models API](https:/www.scaleway.com/en/developers/api/inference/#path-models-list-models) * [Custom models](#custom-models): Models that you import, typically from sources like Hugging Face. -## Scaleway catalog - -### Multimodal models (chat + vision) - -_More details to be added._ - -### Chat models - -| Provider | Model identifier | Documentation | License | -|------------|-----------------------------------|--------------------------------------------------------------------------|-------------------------------------------------------| -| Allen AI | `molmo-72b-0924` | [View Details](/managed-inference/reference-content/model-catalog/#molmo-72b-0924) | [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | -| Deepseek | `deepseek-r1-distill-llama-70b` | [View Details](/managed-inference/reference-content/model-catalog/#deepseek-r1-distill-llama-70b) | [MIT license](https:/huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md) | -| Deepseek | `deepseek-r1-distill-llama-8b` | [View Details](/managed-inference/reference-content/model-catalog/#deepseek-r1-distill-llama-8b) | [MIT license](https:/huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md) | -| Meta | `llama-3-70b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#llama-3-70b-instruct) | [Llama 3 license](https:/www.llama.com/llama3/license/) | -| Meta | `llama-3-8b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#llama-3-8b-instruct) | [Llama 3 license](https:/www.llama.com/llama3/license/) | -| Meta | `llama-3.1-70b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#llama-31-70b-instruct) | [Llama 3.1 community license](https:/www.llama.com/llama3_1/license/) | -| Meta | `llama-3.1-8b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#llama-31-8b-instruct) | [Llama 3.1 license](https:/www.llama.com/llama3_1/license/) | -| Meta | `llama-3.3-70b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#llama-33-70b-instruct) | [Llama 3.3 license](https:/www.llama.com/llama3_3/license/) | -| Nvidia | `llama-3.1-nemotron-70b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#llama-31-nemotron-70b-instruct)| [Llama 3.1 community license](https:/www.llama.com/llama3_1/license/) | -| Mistral | `mixtral-8x7b-instruct-v0.1` | [View Details](/managed-inference/reference-content/model-catalog/#mixtral-8x7b-instruct-v01/) | [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | -| Mistral | `mistral-7b-instruct-v0.3` | [View Details](/managed-inference/reference-content/model-catalog/#mistral-7b-instruct-v03) | [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | -| Mistral | `mistral-nemo-instruct-2407` | [View Details](/managed-inference/reference-content/model-catalog/#mistral-nemo-instruct-2407) | [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | -| Mistral | `mistral-small-24b-instruct-2501` | [View Details](/managed-inference/reference-content/model-catalog/#mistral-small-24b-instruct-2501)| [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | -| Mistral | `pixtral-12b-2409` | [View Details](/managed-inference/reference-content/model-catalog/#pixtral-12b-2409) | [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | -| Qwen | `qwen2.5-coder-32b-instruct` | [View Details](/managed-inference/reference-content/model-catalog/#qwen2.5-coder-32b-instruct) | [Apache 2.0 license](https:/huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct/blob/main/LICENSE) | - -### Vision models - -_More details to be added._ - -### Embedding models - -| Provider | Model identifier | Documentation | License | -|----------|------------------|----------------|---------| -| BAAI | `bge-multilingual-gemma2` | [View Details](/managed-inference/reference-content/model-catalog/#bge-multilingual-gemma2) | [Gemma Terms of Use](https:/ai.google.dev/gemma/terms) | -| Sentence Transformers | `sentence-t5-xxl` | [View Details](/managed-inference/reference-content/model-catalog/#sentence-t5-xxl) | [Apache 2.0 license](https:/www.apache.org/licenses/LICENSE-2.0) | +## Scaleway model catalog +You can find a complete list of all models available in Scaleway's catalog on the [Managed Inference model catalog page](/managed-inference/reference-content/model-catalog/). ## Custom models