Skip to content

Commit 3b89b6e

Browse files
committed
Update fine-tuned AOAI deployment details.
1 parent 75a2075 commit 3b89b6e

File tree

1 file changed

+36
-21
lines changed

1 file changed

+36
-21
lines changed

articles/ai-foundry/openai/how-to/fine-tuning-deploy.md

Lines changed: 36 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -361,37 +361,52 @@ Azure OpenAI fine-tuning supports the following deployment types.
361361

362362
### Standard
363363

364-
[Standard deployments](./deployment-types.md#standard) provides a pay-per-call billing model, and the model available in each region as well as throughput may be limited.
365-
366-
| Models | Region |
367-
|--|--|
368-
|GPT-4o-finetune|East US2, North Central US, Sweden Central|
369-
|gpt-4o-mini-2024-07-18|North Central US, Sweden Central|
370-
|GPT-4-finetune|North Central US, Sweden Central|
371-
|GPT-35-Turbo-finetune|East US2, North Central US, Sweden Central, Switzerland West|
372-
|GPT-35-Turbo-1106-finetune|East US2, North Central US, Sweden Central, Switzerland West|
373-
|GPT-35-Turbo-0125-finetune|East US2, North Central US, Sweden Central, Switzerland West|
364+
[Standard deployments](./deployment-types.md#standard) provide a pay-per-token billing model with data residency confined to the deployed region.
365+
366+
| Models | East US2 | North Central US | Sweden Central | Switzerland West |
367+
|--------------------|:--------:|:----------------:|:--------------:|:----------------:|
368+
|o4-mini || || |
369+
|GPT-4.1 | ||| |
370+
|GPT-4.1-mini | ||| |
371+
|GPT-4.1-nano | ||| |
372+
|GPT-4o || || |
373+
|GPT-4o-mini | ||| |
374+
|GPT-35-Turbo (1106) |||||
375+
|GPT-35-Turbo (0125) |||||
374376

375377
### Global Standard
376378

377379
[Global standard](./deployment-types.md#global-standard) fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
378380

379-
| Models | Region |
380-
|--|--|
381-
|GPT-4.1-finetune|East US2, North Central US, and Sweden Central|
382-
|GPT-4.1-mini-finetune|East US2, North Central US, and Sweden Central|
383-
|GPT-4.1-nano-finetune|East US2, North Central US, and Sweden Central|
384-
|GPT-4o-finetune|East US2, North Central US, and Sweden Central|
385-
|GPT-4o-mini-finetune|East US2, North Central US, and Sweden Central|
381+
Global standard deployments are available from all Azure OpenAI regions for the following models:
382+
383+
* o4-mini
384+
* GPT-4.1
385+
* GPT-4.1-mini
386+
* GPT-4.1-nano
387+
* GPT-4o
388+
* GPT-4o-mini
386389

387390
:::image type="content" source="../media/fine-tuning/global-standard.png" alt-text="Screenshot of the global standard deployment user experience with a fine-tuned model." lightbox="../media/fine-tuning/global-standard.png":::
388391

392+
### Developer Tier
393+
394+
[Developer](./deployment-types.md#developer-for-fine-tuned-models) fine-tuned deployments offer a similar experience as [Global Standard](#global-standard) without an hourly hosting fee, but do not offer an availability SLA. Developer deployments are designed for model candidate evaluation and not for production use.
395+
396+
Developer deployments are available from all Azure OpenAI regions for the following models:
397+
398+
* GPT-4.1
399+
* GPT-4.1-mini
400+
* GPT-4.1-nano
401+
402+
389403
### Provisioned Throughput
390404

391-
| Models | Region |
392-
|--|--|
393-
|GPT-4o-finetune|North Central US, Sweden Central|
394-
|GPT-4o-mini-finetune|North Central US, Sweden Central|
405+
| Models | North Central US | Sweden Central |
406+
|--------------|:----------------:|:--------------:|
407+
| GPT-4.1 | ||
408+
| GPT-4o |||
409+
| GPT-4o-mini |||
395410

396411
[Provisioned throughput](./deployment-types.md#regional-provisioned) fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md) for latency-sensitive agents and applications. They use the same regional provisioned throughput (PTU) capacity as base models, so if you already have regional PTU quota you can deploy your fine-tuned model in support regions.
397412

0 commit comments

Comments
 (0)