Skip to content

Commit 982447c

Browse files
authored
Merge pull request #4228 from voutilad/ft-4.1
Add initial details for 4.1 and 4.1-mini
2 parents 46b8d30 + 935872e commit 982447c

File tree

2 files changed

+10
-10
lines changed

2 files changed

+10
-10
lines changed

articles/ai-services/openai/how-to/fine-tuning-deploy.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -370,15 +370,15 @@ Azure OpenAI fine-tuning supports the following deployment types.
370370

371371
| Models | Region |
372372
|--|--|
373+
|GPT-4.1-finetune|East US2, North Central US, and Sweden Central|
374+
|GPT-4.1-mini-finetune|East US2, North Central US, and Sweden Central|
373375
|GPT-4o-finetune|East US2, North Central US, and Sweden Central|
374376
|GPT-4o-mini-finetune|East US2, North Central US, and Sweden Central|
375377

376378
[Global standard](./deployment-types.md#global-standard) fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
377379

378380
:::image type="content" source="../media/fine-tuning/global-standard.png" alt-text="Screenshot of the global standard deployment user experience with a fine-tuned model." lightbox="../media/fine-tuning/global-standard.png":::
379381

380-
Global Standard fine-tuned deployments currently support structured outputs only on GPT-4o.
381-
382382
### Provisioned Managed (preview)
383383

384384
| Models | Region |
@@ -391,8 +391,6 @@ Global Standard fine-tuned deployments currently support structured outputs only
391391

392392
[Provisioned managed](./deployment-types.md#provisioned) fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md) for fine-tuned deployments. As part of public preview, provisioned managed deployments may be created regionally via the data-plane [REST API](../reference.md#data-plane-inference) version `2024-10-01` or newer. See below for examples.
393393

394-
Provisioned Managed fine-tuned deployments currently support structured outputs only on GPT-4o.
395-
396394
#### Creating a Provisioned Managed deployment
397395

398396
To create a new deployment, make an HTTP PUT call via the [Deployments - Create or Update REST API](/rest/api/aiservices/accountmanagement/deployments/create-or-update?view=rest-aiservices-accountmanagement-2024-10-01&tabs=HTTP&preserve-view=true). The approach is similar to performing [cross region deployment](#cross-region-deployment) with the following exceptions:

articles/ai-services/openai/includes/fine-tune-models.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,11 @@ manager: nitinme
1515
>
1616
> The supported regions for fine-tuning might vary if you use Azure OpenAI models in an Azure AI Foundry project versus outside a project.
1717
18-
| Model ID | Fine-tuning regions | Max request (tokens) | Training Data (up to) |
19-
| --- | --- | :---: | :---: |
20-
| `gpt-35-turbo` (1106) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | Input: 16,385<br> Output: 4,096 | Sep 2021|
21-
| `gpt-35-turbo` (0125) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | 16,385 | Sep 2021 |
22-
| `gpt-4o-mini` (2024-07-18) | North Central US <br> Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 64,536 | Oct 2023 |
23-
| `gpt-4o` (2024-08-06) | East US2 <br> North Central US <br> Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 64,536 | Oct 2023 |
18+
| Model ID | Fine-tuning regions | Max request (tokens) | Training Data (up to) | Modality |
19+
| --- | --- | :---: | :---: | --- |
20+
| `gpt-35-turbo` (1106) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | Input: 16,385<br> Output: 4,096 | Sep 2021 | Text to Text |
21+
| `gpt-35-turbo` (0125) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | 16,385 | Sep 2021 | Text to Text |
22+
| `gpt-4o-mini` (2024-07-18) | North Central US <br> Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 65,536 | Oct 2023 | Text to Text |
23+
| `gpt-4o` (2024-08-06) | East US2 <br> North Central US <br> Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 65,536 | Oct 2023 | Text & Vision to Text |
24+
| `gpt-4.1` (2025-04-14) | North Central US <br> Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 65,536 | May 2024 | Text to Text |
25+
| `gpt-4.1-mini` (2025-04-14) | North Central US <br> Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 65,536 | May 2024 | Text to Text |

0 commit comments

Comments
 (0)