Skip to content

Commit 49e543f

Browse files
committed
update
1 parent b02c280 commit 49e543f

File tree

6 files changed

+9
-1
lines changed

6 files changed

+9
-1
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,12 +38,16 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
3838
| Model | Region |
3939
|---|---|
4040
| `gpt-4.1` (2025-04-14) | East US2 (Global Standard), Sweden Central (Global Standard) |
41+
| `gpt-4.1-nano` (2025-04-14) | East US2 (Global Standard), Sweden Central (Global Standard)|
4142

4243
### Capabilities
4344

4445
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
4546
| --- | :--- |:--- |:---|:---: |
4647
| `gpt-4.1` (2025-04-14) <br> <br> **Latest model from Azure OpenAI** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
48+
| `gpt-4.1-nano` (2025-04-14) <br><br> **Fastest 4.1 model** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 128,000<sup>*</sup> | 32,768 | May 31, 2024 |
49+
50+
<sup>*</sup> It is expected behavior that the context window for this model on Azure OpenAI is currently smaller than the context limit for OpenAI. We will update this page once any changes are made to this value.
4751

4852
## computer-use-preview
4953

articles/ai-services/openai/how-to/function-calling.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,7 @@ At a high level you can break down working with functions into three steps:
4141
* `gpt-4o-mini` (`2024-07-18`)
4242
* `gpt-4.5-preview` (`2025-02-27`)
4343
* `gpt-4.1` (`2025-14-2025`)
44+
* `gpt-4.1-nano` (`2025-14-2025`)
4445

4546
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
4647

articles/ai-services/openai/how-to/prompt-caching.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ Currently only the following models support prompt caching with Azure OpenAI:
3232
- `gpt-4o-realtime-preview` (version 2024-12-17)
3333
- `gpt-4o-mini-realtime-preview` (version 2024-12-17)
3434
- `gpt-4.1-2025-04-14`
35+
- `gpt-4.1-nano-2025-04-14`
3536

3637
> [!NOTE]
3738
> Prompt caching is now also available as part of model fine-tuning for `gpt-4o` and `gpt-4o-mini`. Refer to the fine-tuning section of the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for details.

articles/ai-services/openai/how-to/responses.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,7 @@ The responses API is currently available in the following regions:
4444
- `gpt-4o-mini` (Version: `2024-07-18`)
4545
- `computer-use-preview`
4646
- `gpt-4.1` (Version: `2025-04-14`)
47+
- `gpt-4.1-nano` (Version: `2025-04-14`)
4748

4849
Not every model is available in the regions supported by the responses API. Check the [models page](../concepts/models.md) for model region availability.
4950

articles/ai-services/openai/how-to/structured-outputs.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
3232
- `gpt-4o` version: `2024-08-06`
3333
- `gpt-4o` version: `2024-11-20`
3434
- `gpt-4.1` version `2025-04-14`
35+
- `gpt-4.1-nano` version (`2025-14-2025`)
3536

3637
## API support
3738

articles/ai-services/openai/whats-new.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ This article provides a summary of the latest releases and major documentation u
2323

2424
### GPT-4.1 released
2525

26-
The latest model from Azure OpenAI with a 1 million token context limit. For more information, see the [models page](./concepts/models.md#gpt-41-series).
26+
GPT 4.1 and GPT 4.1-nano are now available. These are the latest models from Azure OpenAI. GPT 4.1 has a 1 million token context limit. For more information, see the [models page](./concepts/models.md#gpt-41-series).
2727

2828
## March 2025
2929

0 commit comments

Comments
 (0)