Skip to content

Commit 24f0d57

Browse files
committed
Merge branch 'main' of https://github.com/MicrosoftDocs/azure-ai-docs-pr into comvis-updates
2 parents a845728 + eacdd79 commit 24f0d57

File tree

683 files changed

+9906
-8475
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

683 files changed

+9906
-8475
lines changed

.openpublishing.redirection.json

Lines changed: 135 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,11 @@
1515
"redirect_url": "/azure/search/search-how-to-dotnet-sdk",
1616
"redirect_document_id": false
1717
},
18+
{
19+
"source_path_from_root": "/articles/ai-services/agents/how-to/tools/overview.md",
20+
"redirect_url": "/azure/ai-services/agents/overview",
21+
"redirect_document_id": false
22+
},
1823
{
1924
"source_path_from_root": "/articles/search/search-howto-index-csv-blobs.md",
2025
"redirect_url": "/azure/search/search-how-to-index-csv-blobs",
@@ -25,10 +30,140 @@
2530
"redirect_url": "/azure/search/search-how-to-large-index",
2631
"redirect_document_id": false
2732
},
33+
{
34+
"source_path_from_root": "/articles/ai-services/agents/concepts/agents.md",
35+
"redirect_url": "/azure/ai-services/agents/overview",
36+
"redirect_document_id": false
37+
},
2838
{
2939
"source_path_from_root": "/articles/ai-services/openai/how-to/use-your-data-securely.md",
3040
"redirect_url": "/azure/ai-services/openai/how-to/on-your-data-configuration",
3141
"redirect_document_id": false
42+
},
43+
{
44+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/concepts/data-formats.md",
45+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
46+
"redirect_document_id": false
47+
},
48+
{
49+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/concepts/entity-components.md",
50+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
51+
"redirect_document_id": false
52+
},
53+
{
54+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/concepts/evaluation-metrics.md",
55+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
56+
"redirect_document_id": false
57+
},
58+
{
59+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/call-api.md",
60+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
61+
"redirect_document_id": false
62+
},
63+
{
64+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/create-project.md",
65+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
66+
"redirect_document_id": false
67+
},
68+
{
69+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/deploy-model.md",
70+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
71+
"redirect_document_id": false
72+
},
73+
{
74+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/design-schema.md",
75+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
76+
"redirect_document_id": false
77+
},
78+
{
79+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/fail-over.md",
80+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
81+
"redirect_document_id": false
82+
},
83+
{
84+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/label-data.md",
85+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
86+
"redirect_document_id": false
87+
},
88+
{
89+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/train-model.md",
90+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
91+
"redirect_document_id": false
92+
},
93+
{
94+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/how-to/view-model-evaluation.md",
95+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
96+
"redirect_document_id": false
97+
},
98+
{
99+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/language-support.md",
100+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
101+
"redirect_document_id": false
102+
},
103+
{
104+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/overview.md",
105+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
106+
"redirect_document_id": false
107+
},
108+
{
109+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/quickstart.md",
110+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
111+
"redirect_document_id": false
112+
},
113+
{
114+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/reference/glossary.md",
115+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
116+
"redirect_document_id": false
117+
},
118+
{
119+
"source_path_from_root": "/articles/ai-services/language-service/custom-text-analytics-for-health/reference/service-limits.md",
120+
"redirect_url": "/azure/ai-services/language-service/text-analytics-for-health/overview",
121+
"redirect_document_id": false
122+
},
123+
{
124+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/concepts/data-formats.md",
125+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
126+
"redirect_document_id": false
127+
},
128+
{
129+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/how-to/call-api.md",
130+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
131+
"redirect_document_id": false
132+
},
133+
{
134+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/how-to/create-project.md",
135+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
136+
"redirect_document_id": false
137+
},
138+
{
139+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/how-to/deploy-model.md",
140+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
141+
"redirect_document_id": false
142+
},
143+
{
144+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/how-to/design-schema.md",
145+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
146+
"redirect_document_id": false
147+
},
148+
{
149+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/how-to/label-data.md",
150+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
151+
"redirect_document_id": false
152+
},
153+
{
154+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/how-to/train-model.md",
155+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
156+
"redirect_document_id": false
157+
},
158+
{
159+
"source_path_from_root": "/articles/ai-services/language-service/sentiment-opinion-mining/custom/quickstart.md",
160+
"redirect_url": "/azure/ai-services/language-service/sentiment-opinion-mining/overview",
161+
"redirect_document_id": false
162+
},
163+
{
164+
"source_path_from_root": "/articles/ai-services/openai/references/azure-machine-learning.md",
165+
"redirect_url": "/azure/ai-services/openai/concepts/use-your-data",
166+
"redirect_document_id": false
32167
}
33168
]
34169
}

articles/ai-foundry/model-inference/breadcrumb/toc.yml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,10 @@
22
tocHref: /azure/
33
topicHref: /azure/index
44
items:
5-
- name: Azure AI services
6-
tocHref: /azure/ai-services/
7-
topicHref: /azure/ai-services/index
5+
- name: AI Foundry
6+
tocHref: /azure/ai-foundry/
7+
topicHref: /azure/ai-studio/index
88
items:
9-
- name: Azure AI Model Inference
10-
tocHref: /azure/ai-foundry/
9+
- name: Model Inference
10+
tocHref: /azure/ai-foundry/model-inference/
1111
topicHref: /azure/ai-foundry/model-inference/index

articles/ai-foundry/model-inference/concepts/models.md

Lines changed: 15 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -49,10 +49,11 @@ Azure OpenAI Service offers a diverse set of models with different capabilities
4949
- Models that can transcribe and translate speech to text
5050

5151
| Model | Type | Tier | Capabilities |
52-
| ------ | ---- | --- | ------------ |
52+
| ------ | ---- | ---- | ------------ |
53+
| [o3-mini](https://ai.azure.com/explore/models/o3-mini/version/2025-01-31/registry/azure-openai) | chat-completion | Global standard | - **Input:** text and image (200,000 tokens) <br /> - **Output:** text (100,000 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON, structured outputs |
5354
| [o1](https://ai.azure.com/explore/models/o1/version/2024-12-17/registry/azure-openai) | chat-completion | Global standard | - **Input:** text and image (200,000 tokens) <br /> - **Output:** text (100,000 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON, structured outputs |
5455
| [o1-preview](https://ai.azure.com/explore/models/o1-preview/version/1/registry/azure-openai) | chat-completion | Global standard <br />Standard<br /> | - **Input:** text (128,000 tokens) <br /> - **Output:** (32,768 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON, structured outputs |
55-
| [o1-mini](https://ai.azure.com/explore/models/o1-mini/version/1/registry/azure-openai) | chat-completion | Global standard <br />Standard | - **Input:** text (128,000 tokens) <br /> - **Output:** (65,536 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON, structured outputs |
56+
| [o1-mini](https://ai.azure.com/explore/models/o1-mini/version/1/registry/azure-openai) | chat-completion | Global standard <br />Standard | - **Input:** text (128,000 tokens) <br /> - **Output:** (65,536 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** No <br /> - **Response formats:** Text |
5657
| [gpt-4o-realtime-preview](https://ai.azure.com/explore/models/gpt-4o-realtime-preview/version/2024-10-01/registry/azure-openai) | real-time | Global standard | - **Input:** control, text, and audio (131,072 tokens) <br /> - **Output:** text and audio (16,384 tokens) <br /> - **Languages:** en <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
5758
| [gpt-4o](https://ai.azure.com/explore/models/gpt-4o/version/2024-11-20/registry/azure-openai) | chat-completion | Global standard <br />Standard<br />Batch<br />Provisioned<br />Global provisioned<br />Data Zone | - **Input:** text and image (131,072 tokens) <br /> - **Output:** text (16,384 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON, structured outputs |
5859
| [gpt-4o-mini](https://ai.azure.com/explore/models/gpt-4o-mini/version/2024-07-18/registry/azure-openai) | chat-completion | Global standard <br />Standard<br />Batch<br />Provisioned<br />Global provisioned<br />Data Zone | - **Input:** text, image, and audio (131,072 tokens) <br /> - **Output:** (16,384 tokens) <br /> - **Languages:** `en`, `it`, `af`, `es`, `de`, `fr`, `id`, `ru`, `pl`, `uk`, `el`, `lv`, `zh`, `ar`, `tr`, `ja`, `sw`, `cy`, `ko`, `is`, `bn`, `ur`, `ne`, `th`, `pa`, `mr`, and `te`. <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON, structured outputs |
@@ -90,6 +91,16 @@ Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-o
9091

9192
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=core42).
9293

94+
### DeepSeek
95+
96+
DeepSeek family of models include DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
97+
98+
| Model | Type | Tier | Capabilities |
99+
| ------ | ---- | --- | ------------ |
100+
| [DeekSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | Global standard | - **Input:** text (16,384 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |
101+
102+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
103+
93104
### Meta
94105

95106
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
@@ -140,10 +151,10 @@ Mistral AI offers two categories of models: premium models including Mistral Lar
140151
| Model | Type | Tier | Capabilities |
141152
| ------ | ---- | --- | ------------ |
142153
| [Ministral-3B](https://ai.azure.com/explore/models/Ministral-3B/version/1/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Languages:** fr, de, es, it, and en <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
143-
| [Mistral-large](https://ai.azure.com/explore/models/Mistral-large/version/1/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (32,768 tokens) <br /> - **Output:** (4,096 tokens) <br /> - **Languages:** fr, de, es, it, and en <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
154+
| [Mistral-large](https://ai.azure.com/explore/models/Mistral-large/version/1/registry/azureml-mistral) <br /> (deprecated) | chat-completion | Global standard | - **Input:** text (32,768 tokens) <br /> - **Output:** (4,096 tokens) <br /> - **Languages:** fr, de, es, it, and en <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
144155
| [Mistral-small](https://ai.azure.com/explore/models/Mistral-small/version/1/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (32,768 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Languages:** fr, de, es, it, and en <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
145156
| [Mistral-Nemo](https://ai.azure.com/explore/models/Mistral-Nemo/version/1/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Languages:** en, fr, de, es, it, zh, ja, ko, pt, nl, and pl <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
146-
| [Mistral-large-2407](https://ai.azure.com/explore/models/Mistral-large-2407/version/1/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** (4,096 tokens) <br /> - **Languages:** en, fr, de, es, it, zh, ja, ko, pt, nl, and pl <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
157+
| [Mistral-large-2407](https://ai.azure.com/explore/models/Mistral-large-2407/version/1/registry/azureml-mistral) <br /> (legacy) | chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** (4,096 tokens) <br /> - **Languages:** en, fr, de, es, it, zh, ja, ko, pt, nl, and pl <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
147158
| [Mistral-Large-2411](https://ai.azure.com/explore/models/Mistral-Large-2411/version/2/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (128,000 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Languages:** en, fr, de, es, it, zh, ja, ko, pt, nl, and pl <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON |
148159
| [Codestral-2501](https://ai.azure.com/explore/models/Codestral-2501/version/2/registry/azureml-mistral) | chat-completion | Global standard | - **Input:** text (262,144 tokens) <br /> - **Output:** text (4,096 tokens) <br /> - **Languages:** en <br /> - **Tool calling:** No <br /> - **Response formats:** Text |
149160

articles/ai-foundry/model-inference/how-to/inference.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,14 @@ For a chat model, you can create a request as follows:
4848

4949
If you specify a model name that doesn't match any given model deployment, you get an error that the model doesn't exist. You can control which models are available for users by creating model deployments as explained at [add and configure model deployments](create-model-deployments.md).
5050

51+
## Key-less authentication
52+
53+
Models deployed to Azure AI model inference in Azure AI Services support key-less authorization using Microsoft Entra ID. Key-less authorization enhances security, simplifies the user experience, reduces operational complexity, and provides robust compliance support for modern development. It makes it a strong choice for organizations adopting secure and scalable identity management solutions.
54+
55+
To use key-less authentication, [configure your resource and grant access to users](configure-entra-id.md) to perform inference. Once configured, then you can authenticate as follows:
56+
57+
[!INCLUDE [code-create-chat-client-entra](../includes/code-create-chat-client-entra.md)]
58+
5159
## Limitations
5260

5361
* Azure OpenAI Batch can't be used with the Azure AI model inference endpoint. You have to use the dedicated deployment URL as explained at [Batch API support in Azure OpenAI documentation](../../../ai-services/openai/how-to/batch.md#api-support).
@@ -56,4 +64,4 @@ If you specify a model name that doesn't match any given model deployment, you g
5664
## Next steps
5765

5866
* [Use embedding models](use-embeddings.md)
59-
* [Use chat completion models](use-chat-completions.md)
67+
* [Use chat completion models](use-chat-completions.md)

0 commit comments

Comments
 (0)