Skip to content

Commit 32a986e

Browse files
committed
fixing merge conflict
2 parents 77666d1 + c63d54c commit 32a986e

File tree

87 files changed

+2825
-583
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

87 files changed

+2825
-583
lines changed

articles/ai-foundry/model-inference/breadcrumb/toc.yml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,10 @@
22
tocHref: /azure/
33
topicHref: /azure/index
44
items:
5-
- name: Azure AI services
6-
tocHref: /azure/ai-services/
7-
topicHref: /azure/ai-services/index
5+
- name: AI Foundry
6+
tocHref: /azure/ai-foundry/
7+
topicHref: /azure/ai-studio/index
88
items:
9-
- name: Azure AI Model Inference
10-
tocHref: /azure/ai-foundry/
9+
- name: Model Inference
10+
tocHref: /azure/ai-foundry/model-inference/
1111
topicHref: /azure/ai-foundry/model-inference/index

articles/ai-foundry/model-inference/concepts/models.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,9 @@ ms.custom: references_regions, tool_generated
1717

1818
Azure AI model inference in Azure AI Foundry gives you access to flagship models in Azure AI to consume them as APIs without hosting them on your infrastructure.
1919

20+
> [!TIP]
21+
> DeepSeek-R1 is available for deployment as [Serverless API endpoint](../../../ai-studio/how-to/deploy-models-deepseek.md).
22+
2023
:::image type="content" source="../media/models/models-catalog.gif" alt-text="An animation showing Azure AI studio model catalog section and the models available." lightbox="../media/models/models-catalog.gif":::
2124

2225
Model availability varies by model provider, deployment SKU, and cloud. All models available in Azure AI Model Inference support the [Global standard](deployment-types.md#global-standard) deployment type which uses global capacity to guarantee throughput. [Azure OpenAI models](#azure-openai) also support regional deployments and [sovereign clouds](/entra/identity-platform/authentication-national-cloud)—Azure Government, Azure Germany, and Azure China 21Vianet.

articles/ai-foundry/model-inference/how-to/quickstart-ai-project.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -141,7 +141,7 @@ Use the parameter `model="<deployment-name>` to route your request to this deplo
141141

142142
## Move from Serverless API Endpoints to Azure AI model inference
143143

144-
Although you configured the project to use the Azure AI model inference, existing model deployments continue to exit within the project as Serverless API Endpoints. Those deployments aren't moved for you. Hence, you can progressively upgrade any existing code that reference previous model deployments. To start moving the model deployments, we recommend the following workflow:
144+
Although you configured the project to use the Azure AI model inference, existing model deployments continue to exist within the project as Serverless API Endpoints. Those deployments aren't moved for you. Hence, you can progressively upgrade any existing code that reference previous model deployments. To start moving the model deployments, we recommend the following workflow:
145145

146146
1. Recreate the model deployment in Azure AI model inference. This model deployment is accessible under the **Azure AI model inference endpoint**.
147147

articles/ai-foundry/model-inference/overview.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,9 @@ recommendations: false
1818

1919
Azure AI model inference provides access to the most powerful models available in the Azure AI model catalog. The models come from key model providers in the industry, including OpenAI, Microsoft, Meta, Mistral, Cohere, G42, and AI21 Labs. These models can be integrated with software solutions to deliver a wide range of tasks that include content generation, summarization, image understanding, semantic search, and code generation.
2020

21+
> [!TIP]
22+
> DeepSeek-R1 is available for deployment as [Serverless API endpoint](../../ai-studio/how-to/deploy-models-deepseek.md).
23+
2124
Azure AI model inference provides a way to **consume models as APIs without hosting them on your infrastructure**. Models are hosted in a Microsoft-managed infrastructure, which enables API-based access to the model provider's model. API-based access can dramatically reduce the cost of accessing a model and simplify the provisioning experience.
2225

2326
Azure AI model inference is part of Azure AI Services, and users can access the service through [REST APIs](./reference/reference-model-inference-api.md), [SDKs in several languages](supported-languages.md) such as Python, C#, JavaScript, and Java. You can also use the Azure AI model inference from [Azure AI Foundry by configuring a connection](how-to/configure-project-connection.md).

articles/ai-foundry/model-inference/reference/reference-model-inference-api.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,8 @@ Models deployed to [serverless API endpoints](../../../ai-studio/how-to/deploy-m
4949
> * [Mistral-Large](../../../ai-studio/how-to/deploy-models-mistral.md)
5050
> * [Jais](../../../ai-studio/how-to/deploy-jais-models.md) family of models
5151
> * [Jamba](../../../ai-studio/how-to/deploy-models-jamba.md) family of models
52-
> * [Phi-3](../../../ai-studio/how-to/deploy-models-phi-3.md) family of models
52+
> * [Phi-3](../../../ai-studio/how-to/deploy-models-phi-3.md) and [Phi-4](../../../ai-studio/how-to/deploy-models-phi-4.md) family of models
53+
> * [DeepSeek-R1](../../../ai-studio/how-to/deploy-models-deepseek.md) family of models.
5354
5455
Models deployed to [managed inference](../../../ai-studio/concepts/deployments-overview.md):
5556

articles/ai-services/agents/concepts/model-region-support.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ author: aahill
77
ms.author: aahi
88
ms.service: azure-ai-agent-service
99
ms.topic: conceptual
10-
ms.date: 01/27/2025
10+
ms.date: 01/29/2025
1111
ms.custom: azure-ai-agents
1212
---
1313

@@ -27,6 +27,7 @@ Azure AI Agent Service supports the same models as the chat completions API in A
2727
| francecentral | - | - | - ||| - | - |||| - ||
2828
| japaneast | - | - | - | - | - | - | - | - || - |||
2929
| norwayeast |- | - | - | - || - |- | - | - | - | - | - |
30+
| southindia | - | - | - | - || - | - | - | - ||| - |
3031
| swedencentral |||||| - ||||| - ||
3132
| uksouth | - | - | - | - ||| - | - |||||
3233
| westus |||| - || - || - | - ||| - |
@@ -37,9 +38,10 @@ Azure AI Agent Service supports the same models as the chat completions API in A
3738

3839
The Azure AI Agent Service also supports the following models from the Azure AI Foundry model catalog.
3940

40-
* Llama 3.1-70B-instruct
41+
* Meta-Llama-405B-Instruct
4142
* Mistral-large-2407
42-
* Cohere command R+
43+
* Cohere-command-r-plus
44+
* Cohere-command-r
4345

4446
To use these models, you can use Azure AI Foundry portal to make a deployment, and then reference the deployment name in your agent. For example:
4547

articles/ai-services/agents/how-to/tools/azure-functions.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: azure-ai-agent-service
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 12/11/2024
9+
ms.date: 01/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
ms.custom: azure-ai-agents

articles/ai-services/agents/how-to/tools/function-calling.md

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 12/11/2024
9+
ms.date: 01/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
zone_pivot_groups: selection-function-calling
@@ -22,12 +22,6 @@ Azure AI Agents supports function calling, which allows you to describe the stru
2222
> [!NOTE]
2323
> Runs expire ten minutes after creation. Be sure to submit your tool outputs before the expiration.
2424
25-
### Supported models
26-
27-
The [models page](../../concepts/model-region-support.md) contains the most up-to-date information on regions/models where Agents are supported.
28-
29-
To use all features of function calling including parallel functions, you need to use a model that was released after November 6, 2023.
30-
3125
### Usage support
3226

3327
|Azure AI foundry support | Python SDK | C# SDK | Basic agent setup | Standard agent setup |

articles/ai-services/agents/includes/quickstart-csharp.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,8 +16,6 @@ ms.date: 01/15/2025
1616
* Make sure you have the **Azure AI Developer** [RBAC role](../../../ai-studio/concepts/rbac-ai-studio.md) assigned at the appropriate level.
1717
* Install [the Azure CLI and the machine learning extension](/azure/machine-learning/how-to-configure-cli). If you have the CLI already installed, make sure it's updated to the latest version.
1818

19-
[!INCLUDE [rbac-roles](rbac-roles.md)]
20-
2119
[!INCLUDE [bicep-setup](bicep-setup.md)]
2220

2321
## Configure and run an agent

articles/ai-services/agents/includes/quickstart-python-openai.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,8 +18,6 @@ ms.date: 11/13/2024
1818
* You need the **Cognitive Services OpenAI User** role assigned to use the Azure AI Services resource.
1919
* Install [the Azure CLI and the machine learning extension](/azure/machine-learning/how-to-configure-cli). If you have the CLI already installed, make sure it's updated to the latest version.
2020

21-
[!INCLUDE [rbac-roles](rbac-roles.md)]
22-
2321
[!INCLUDE [bicep-setup](bicep-setup.md)]
2422

2523
## Configure and run an agent

0 commit comments

Comments
 (0)