feat: deepseek

santiagxf · santiagxf · commit 6c4791a6bfde · 2025-01-31T15:53:44.000-05:00
diff --git a/articles/ai-foundry/model-inference/concepts/models.md b/articles/ai-foundry/model-inference/concepts/models.md
@@ -17,9 +17,6 @@ ms.custom: references_regions, tool_generated
 
 Azure AI model inference in Azure AI Foundry gives you access to flagship models in Azure AI to consume them as APIs without hosting them on your infrastructure.
 
-> [!TIP]
-> DeepSeek-R1 (preview) is available for deployment as [Serverless API endpoint](../../../ai-studio/how-to/deploy-models-deepseek.md).
-
 :::image type="content" source="../media/models/models-catalog.gif" alt-text="An animation showing Azure AI studio model catalog section and the models available." lightbox="../media/models/models-catalog.gif":::
 
 Model availability varies by model provider, deployment SKU, and cloud. All models available in Azure AI Model Inference support the [Global standard](deployment-types.md#global-standard) deployment type which uses global capacity to guarantee throughput. [Azure OpenAI models](#azure-openai) also support regional deployments and [sovereign clouds](/entra/identity-platform/authentication-national-cloud)—Azure Government, Azure Germany, and Azure China 21Vianet.
@@ -93,6 +90,16 @@ Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-o
 
 See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=core42).
 
+### DeepSeek
+
+DeepSeek family of models include DeepSeek-R1 excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
+
+| Model  | Type | Tier | Capabilities |
+| ------ | ---- | --- | ------------ |
+| [DeekSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek) | chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:**  (4,096 tokens) <br /> - **Languages:** `en` and `zh` <br />  - **Tool calling:** No <br /> - **Response formats:** Text (with reasoning content). |
+
+See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
+
 ### Meta
 
 Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
diff --git a/articles/ai-foundry/model-inference/overview.md b/articles/ai-foundry/model-inference/overview.md
@@ -19,7 +19,7 @@ recommendations: false
 Azure AI model inference provides access to the most powerful models available in the Azure AI model catalog. The models come from key model providers in the industry, including OpenAI, Microsoft, Meta, Mistral, Cohere, G42, and AI21 Labs. These models can be integrated with software solutions to deliver a wide range of tasks that include content generation, summarization, image understanding, semantic search, and code generation.
 
 > [!TIP]
-> DeepSeek-R1 is available for deployment as [Serverless API endpoint](../../ai-studio/how-to/deploy-models-deepseek.md).
+> To deploy DeepSeek-R1 in Azure AI model inference, follow the steps at [Add and configure models](how-to/create-model-deployments.md).
 
 Azure AI model inference provides a way to **consume models as APIs without hosting them on your infrastructure**. Models are hosted in a Microsoft-managed infrastructure, which enables API-based access to the model provider's model. API-based access can dramatically reduce the cost of accessing a model and simplify the provisioning experience.
 
@@ -38,6 +38,7 @@ You can get access to the key model providers in the industry including OpenAI,
 | [Azure OpenAI](concepts/models.md#azure-openai) | - o1 <br /> - gpt-4o <br /> - o1-preview <br /> - o1-mini <br /> - gpt-4o-mini <br /> - text-embedding-3-large <br /> - text-embedding-3-small <br /> |
 | [Cohere](concepts/models.md#cohere) | - Cohere-embed-v3-english <br /> - Cohere-embed-v3-multilingual <br /> - Cohere-command-r-plus-08-2024 <br /> - Cohere-command-r-08-2024 <br /> - Cohere-command-r-plus <br /> - Cohere-command-r <br /> |
 | [Core42](concepts/models.md#core42) | - jais-30b-chat <br /> |
+| [DeepSeek](concepts/models.md#deepseek) | - DeepSeek-R1 |
 | [Meta](concepts/models.md#meta) | - Llama-3.3-70B-Instruct <br /> - Llama-3.2-11B-Vision-Instruct <br /> - Llama-3.2-90B-Vision-Instruct <br /> - Meta-Llama-3.1-405B-Instruct <br /> - Meta-Llama-3-8B-Instruct <br /> - Meta-Llama-3.1-70B-Instruct <br /> - Meta-Llama-3.1-8B-Instruct <br /> - Meta-Llama-3-70B-Instruct <br /> |
 | [Microsoft](concepts/models.md#microsoft) | - Phi-3-mini-128k-instruct <br /> - Phi-3-mini-4k-instruct <br /> - Phi-3-small-8k-instruct <br /> - Phi-3-medium-128k-instruct <br /> - Phi-3-medium-4k-instruct <br /> - Phi-3.5-vision-instruct <br /> - Phi-3.5-MoE-instruct <br /> - Phi-3-small-128k-instruct <br /> - Phi-3.5-mini-instruct <br /> - Phi-4 <br /> |
 | [Mistral AI](concepts/models.md#mistral-ai) | - Ministral-3B <br /> - Mistral-large <br /> - Mistral-small <br /> - Mistral-Nemo <br /> - Mistral-large-2407 <br /> - Mistral-Large-2411 <br /> - Codestral-2501 <br /> |