Skip to content

Commit 2dd40d2

Browse files
authored
Merge pull request #4097 from MicrosoftDocs/main
OOB Publish - 04/14 - ASAP
2 parents 3ccbd78 + 53958ec commit 2dd40d2

File tree

10 files changed

+80
-54
lines changed

10 files changed

+80
-54
lines changed

articles/ai-foundry/model-inference/includes/create-model-deployments/cli.md

Lines changed: 4 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -48,15 +48,16 @@ To add a model, you first need to identify the model that you want to deploy. Yo
4848
```azurecli
4949
accountName="<ai-services-resource-name>"
5050
resourceGroupName="<resource-group>"
51+
location="eastus2"
5152
```
5253
5354
3. If you don't have an Azure AI Services account create yet, you can create one as follows:
5455
5556
```azurecli
56-
az cognitiveservices account create -n $accountName -g $resourceGroupName --custom-domain $accountName
57+
az cognitiveservices account create -n $accountName -g $resourceGroupName --custom-domain $accountName --location $location --kind AIServices --sku S0
5758
```
5859
59-
4. Let's see first which models are available to you and under which SKU. The following command list all the model definitions available:
60+
4. Let's see first which models are available to you and under which SKU. SKUs, also known as [deployment types](../../concepts/deployment-types.md), define how Azure infrastructure is used to process requests. Models may offer different deployment types. The following command list all the model definitions available:
6061
6162
```azurecli
6263
az cognitiveservices account list-models \
@@ -77,10 +78,7 @@ To add a model, you first need to identify the model that you want to deploy. Yo
7778
}
7879
```
7980
80-
6. Identify the model you want to deploy. You need the properties `name`, `format`, `version`, and `sku`. Capacity might also be needed depending on the type of deployment.
81-
82-
> [!TIP]
83-
> Notice that not all the models are available in all the SKUs.
81+
6. Identify the model you want to deploy. You need the properties `name`, `format`, `version`, and `sku`. The property `format` indicates the provider offering the model. Capacity might also be needed depending on the type of deployment.
8482
8583
7. Add the model deployment to the resource. The following example adds `Phi-3.5-vision-instruct`:
8684

articles/ai-services/language-service/whats-new.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -17,14 +17,14 @@ Azure AI Language is updated on an ongoing basis. To stay up-to-date with recent
1717
## March 2025
1818

1919
* Azure AI Language resource now can be deployed to 3 new regions, Jio India Central, UK West and Canada East, for the following capabilities:
20-
* Language detection
21-
* Sentiment analysis
22-
* Key phrase extraction
23-
* Named entity recognition (NER)
24-
* Personally identifiable information (PII) entity recognition
25-
* Entity linking
26-
* Text analytics for health
27-
* Extractive text summarization
20+
* Language detection
21+
* Sentiment analysis
22+
* Key phrase extraction
23+
* Named entity recognition (NER)
24+
* Personally identifiable information (PII) entity recognition
25+
* Entity linking
26+
* Text analytics for health
27+
* Extractive text summarization
2828

2929
* Back-end infrastructure for the Named entity recognition (NER) and Text Personally identifiable information (PII) entity recognition models is now updated with extended context window limits.
3030

articles/ai-services/openai/concepts/models.md

Lines changed: 18 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,11 +4,11 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 04/01/2025
7+
ms.date: 04/14/2025
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
11-
ms.author: mbullwin #chrhoder
11+
ms.author: mbullwin #chrhoder#
1212
recommendations: false
1313
---
1414

@@ -18,6 +18,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
1818

1919
| Models | Description |
2020
|--|--|
21+
| [GPT-4.1 series](#gpt-41-series) | Latest model release from Azure OpenAI |
2122
| [computer-use-preview](#computer-use-preview) | An experimental model trained for use with the Responses API computer use tool. |
2223
| [GPT-4.5 Preview](#gpt-45-preview) |The latest GPT model that excels at diverse text and image tasks. |
2324
| [o-series models](#o-series-models) |[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
@@ -30,6 +31,20 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
3031
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
3132
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
3233

34+
## GPT 4.1 series
35+
36+
### Region Availability
37+
38+
| Model | Region |
39+
|---|---|
40+
| `gpt-4.1` (2025-04-14) | East US2 (Global Standard), Sweden Central (Global Standard) |
41+
42+
### Capabilities
43+
44+
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
45+
| --- | :--- |:--- |:---|:---: |
46+
| `gpt-4.1` (2025-04-14) <br> <br> **Latest model from Azure OpenAI** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
47+
3348
## computer-use-preview
3449

3550
An experimental model trained for use with the [Responses API](../how-to/responses.md) computer use tool. It can be used in conjunction with 3rd-party libraries to allow the model to control mouse & keyboard input while getting context from screenshots of the current environment.
@@ -70,7 +85,7 @@ Once access has been granted, you will need to create a deployment for the model
7085

7186
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
7287
| --- | :--- |:--- |:---|:---: |
73-
| `gpt-4.5-preview` (2025-02-27) <br> **GPT-4.5 Preview** | The **latest GPT model** that excels at diverse text and image tasks. <br>-Structured outputs <br>-Prompt caching <br>-Tools <br>-Streaming<br>-Text(input/output)<br>- Image(input) | 128,000 | 16,384 | Oct 2023 |
88+
| `gpt-4.5-preview` (2025-02-27) <br> **GPT-4.5 Preview** | [GPT 4.1](#gpt-41-series) is the recommended replacement for this model. Excels at diverse text and image tasks. <br>-Structured outputs <br>-Prompt caching <br>-Tools <br>-Streaming<br>-Text(input/output)<br>- Image(input) | 128,000 | 16,384 | Oct 2023 |
7489

7590
> [!NOTE]
7691
> It is expected behavior that the model cannot answer questions about itself. If you want to know when the knowledge cutoff for the model's training data is, or other details about the model you should refer to the model documentation above.

articles/ai-services/openai/quotas-limits.md

Lines changed: 12 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.custom:
99
- ignite-2023
1010
- references_regions
1111
ms.topic: conceptual
12-
ms.date: 4/09/2025
12+
ms.date: 4/14/2025
1313
ms.author: mbullwin
1414
---
1515

@@ -60,6 +60,14 @@ The following sections provide you with a quick guide to the default quotas and
6060
6161
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
6262

63+
## GPT 4.1 series
64+
65+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
66+
|---|---|:---:|:---:|
67+
| `gpt-4.1` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
68+
| `gpt-4.1` (2025-04-14) | Default | 1 M | 1 K |
69+
70+
6371
## computer-use-preview global standard
6472

6573
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
@@ -199,9 +207,9 @@ If your Azure subscription is linked to certain [offer types](https://azure.micr
199207

200208
|Tier| Quota Limit in tokens per minute (TPM) |
201209
|---|:---|
202-
|`Azure for Students` | 1 K (all models) <br>Exception o-series & GPT 4.5 Preview: 0|
203-
| `MSDN` | GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K <br>computer-use-preview: 8 K <br> gpt-4o-realtime-preview: 1 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 |
204-
|`Pay-as-you-go` | GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K <br>computer-use-preview: 30 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 |
210+
|`Azure for Students` | 1 K (all models) <br>Exception o-series & GPT-4.1 & GPT 4.5 Preview: 0|
211+
| `MSDN` | GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K <br>computer-use-preview: 8 K <br> gpt-4o-realtime-preview: 1 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 <br> GPT-4.1: 0 |
212+
|`Pay-as-you-go` | GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K <br>computer-use-preview: 30 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 <br> GPT-4.1: 0 |
205213
| `Azure_MS-AZR-0111P` <br> `Azure_MS-AZR-0035P` <br> `Azure_MS-AZR-0025P` <br> `Azure_MS-AZR-0052P` <br>| GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K |
206214
| `CSP Integration Sandbox` <sup>*</sup> | All models: 0 |
207215
| `Lightweight trial`<br>`Free Trials`<br>`Azure Pass` | All models: 0 |

articles/ai-services/openai/whats-new.md

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,21 +4,27 @@ titleSuffix: Azure AI services
44
description: Learn about the latest news and features updates for Azure OpenAI.
55
manager: nitinme
66
author: mrbullwinkle
7-
ms.author: mbullwin
7+
ms.author: mbullwin #
88
ms.service: azure-ai-openai
99
ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
- ignite-2024
1313
ms.topic: whats-new
14-
ms.date: 03/05/2025
14+
ms.date: 04/14/2025
1515
recommendations: false
1616
---
1717

1818
# What's new in Azure OpenAI Service
1919

2020
This article provides a summary of the latest releases and major documentation updates for Azure OpenAI Service.
2121

22+
## April 2025
23+
24+
### GPT-4.1 released
25+
26+
The latest model from Azure OpenAI with a 1 million token context limit. For more information, see the [models page](./concepts/models.md#gpt-41-series).
27+
2228
## March 2025
2329

2430
### Responses API & computer-use-preview model

articles/ai-services/speech-service/batch-transcription-audio-data.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,6 @@ The batch transcription API (and [fast transcription API](./fast-transcription-c
3939
- MULAW in WAV container
4040
- AMR
4141
- WebM
42-
- M4A
4342
- SPEEX
4443

4544

articles/ai-services/speech-service/fast-transcription-create.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ Unlike the batch transcription API, fast transcription API only produces transcr
2424

2525
- An Azure AI Speech resource in one of the regions where the fast transcription API is available. The supported regions are: **Australia East**, **Brazil South**, **Central India**, **East US**, **East US 2**, **French Central**, **Japan East**, **North Central US**, **North Europe**, **South Central US**, **Southeast Asia**, **Sweden Central**, **UK South**, **West Europe**, **West US**, **West US 2**, **West US 3**. For more information about regions supported for other Speech service features, see [Speech service regions](./regions.md).
2626

27-
- An audio file (less than 2 hours long and less than 200 MB in size) in one of the formats and codecs supported by the batch transcription API: WAV, MP3, OPUS/OGG, FLAC, WMA, AAC, ALAW in WAV container, MULAW in WAV container, AMR, WebM, M4A, and SPEEX. For more information about supported audio formats, see [supported audio formats](./batch-transcription-audio-data.md#supported-audio-formats-and-codecs).
27+
- An audio file (less than 2 hours long and less than 200 MB in size) in one of the formats and codecs supported by the batch transcription API: WAV, MP3, OPUS/OGG, FLAC, WMA, AAC, ALAW in WAV container, MULAW in WAV container, AMR, WebM, and SPEEX. For more information about supported audio formats, see [supported audio formats](./batch-transcription-audio-data.md#supported-audio-formats-and-codecs).
2828

2929
## Use the fast transcription API
3030

0 commit comments

Comments
 (0)