Skip to content

Commit d9a76ea

Browse files
Merge pull request #4140 from mrbullwinkle/mrb_04_15_2025_nano
[Azure OpenAI] GPT 4.1 nano
2 parents 8378d0d + f5d50e6 commit d9a76ea

File tree

6 files changed

+8
-1
lines changed

6 files changed

+8
-1
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,12 +38,15 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
3838
| Model | Region |
3939
|---|---|
4040
| `gpt-4.1` (2025-04-14) | East US2 (Global Standard), Sweden Central (Global Standard) |
41+
| `gpt-4.1-nano` (2025-04-14) | East US2 (Global Standard), Sweden Central (Global Standard)|
4142

4243
### Capabilities
4344

4445
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
4546
| --- | :--- |:--- |:---|:---: |
4647
| `gpt-4.1` (2025-04-14) <br> <br> **Latest model from Azure OpenAI** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
48+
| `gpt-4.1-nano` (2025-04-14) <br><br> **Fastest 4.1 model** | - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
49+
4750

4851
## computer-use-preview
4952

articles/ai-services/openai/how-to/function-calling.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,7 @@ At a high level you can break down working with functions into three steps:
4141
* `gpt-4o-mini` (`2024-07-18`)
4242
* `gpt-4.5-preview` (`2025-02-27`)
4343
* `gpt-4.1` (`2025-14-2025`)
44+
* `gpt-4.1-nano` (`2025-14-2025`)
4445

4546
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
4647

articles/ai-services/openai/how-to/prompt-caching.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ Currently only the following models support prompt caching with Azure OpenAI:
3232
- `gpt-4o-realtime-preview` (version 2024-12-17)
3333
- `gpt-4o-mini-realtime-preview` (version 2024-12-17)
3434
- `gpt-4.1-2025-04-14`
35+
- `gpt-4.1-nano-2025-04-14`
3536

3637
> [!NOTE]
3738
> Prompt caching is now also available as part of model fine-tuning for `gpt-4o` and `gpt-4o-mini`. Refer to the fine-tuning section of the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for details.

articles/ai-services/openai/how-to/responses.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,7 @@ The responses API is currently available in the following regions:
4444
- `gpt-4o-mini` (Version: `2024-07-18`)
4545
- `computer-use-preview`
4646
- `gpt-4.1` (Version: `2025-04-14`)
47+
- `gpt-4.1-nano` (Version: `2025-04-14`)
4748

4849
Not every model is available in the regions supported by the responses API. Check the [models page](../concepts/models.md) for model region availability.
4950

articles/ai-services/openai/how-to/structured-outputs.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
3232
- `gpt-4o` version: `2024-08-06`
3333
- `gpt-4o` version: `2024-11-20`
3434
- `gpt-4.1` version `2025-04-14`
35+
- `gpt-4.1-nano` version (`2025-14-2025`)
3536

3637
## API support
3738

articles/ai-services/openai/whats-new.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ This article provides a summary of the latest releases and major documentation u
2323

2424
### GPT-4.1 released
2525

26-
The latest model from Azure OpenAI with a 1 million token context limit. For more information, see the [models page](./concepts/models.md#gpt-41-series).
26+
GPT 4.1 and GPT 4.1-nano are now available. These are the latest models from Azure OpenAI. GPT 4.1 has a 1 million token context limit. For more information, see the [models page](./concepts/models.md#gpt-41-series).
2727

2828
## March 2025
2929

0 commit comments

Comments
 (0)