Skip to content

Commit 1609299

Browse files
committed
update
1 parent 6c8d406 commit 1609299

File tree

6 files changed

+173
-23
lines changed

6 files changed

+173
-23
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 28 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -85,6 +85,14 @@ You can also use the OpenAI text to speech voices via Azure AI Speech. To learn
8585

8686
## Model summary table and region availability
8787

88+
### Standard deployment model availability
89+
90+
[!INCLUDE [Standard Models](includes/model-matrix/standard-models.md)]
91+
92+
### Standard deployment model quota
93+
94+
[!INCLUDE [Quota](includes/model-matrix/quota.md)]
95+
8896
### GPT-4 and GPT-4 Turbo Preview models
8997

9098
GPT-4, GPT-4-32k, and GPT-4 Turbo with Vision are now available to all Azure OpenAI Service customers. Availability varies by region. If you don't see GPT-4 in your region, please check back later.
@@ -128,13 +136,7 @@ GPT-4 version 0125-preview is an updated version of the GPT-4 Turbo preview prev
128136

129137
#### Public cloud regions
130138

131-
| Model | Regions where model is available to all subscriptions with Azure OpenAI access | Regions where model is available only to subscriptions with previous access to that model/region |
132-
|---|:---|:---|
133-
| gpt-4 (0314) | | East US <br> France Central <br> South Central US <br> UK South |
134-
| gpt-4 (0613) | Australia East <br> Canada East <br> France Central <br> Sweden Central <br> Switzerland North | East US <br> East US 2 <br> Japan East <br> UK South |
135-
| gpt-4 (1106-Preview) | Australia East <br> Canada East <br> East US 2 <br> France Central <br> Norway East <br> South India <br> Sweden Central <br> UK South <br> West US | |
136-
| gpt-4 (0125-Preview) | East US <br> North Central US <br> South Central US <br> |
137-
| gpt-4 (vision-preview) | Sweden Central <br> West US <br> Japan East <br> Switzerland North <br> Australia East| |
139+
[!INCLUDE [GPT-4](includes/model-matrix/standard-gpt-4.md)]
138140

139141
#### Azure Government regions
140142

@@ -144,7 +146,6 @@ The following GPT-4 models are available with [Azure Government](/azure/azure-go
144146
|--|--|
145147
| `gpt-4` (1106-Preview) | US Gov Virginia<br>US Gov Arizona |
146148

147-
148149
### GPT-3.5 models
149150

150151
> [!IMPORTANT]
@@ -159,19 +160,20 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
159160
> [!NOTE]
160161
> Version `0613` of `gpt-35-turbo` and `gpt-35-turbo-16k` will be retired no earlier than June 13, 2024. Version `0301` of `gpt-35-turbo` will be retired no earlier than July 5, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
161162
162-
### GPT-3.5-Turbo model availability
163+
| Model ID | Max Request (tokens) | Training Data (up to) |
164+
| --------- |:------:|:----:|
165+
| `gpt-35-turbo`**<sup>1</sup>** (0301) | 4,096 | Sep 2021 |
166+
| `gpt-35-turbo` (0613) | 4,096 | Sep 2021 |
167+
| `gpt-35-turbo-16k` (0613) | 16,384 | Sep 2021 |
168+
| `gpt-35-turbo-instruct` (0914) | 4,097 |Sep 2021 |
169+
| `gpt-35-turbo` (1106) | Input: 16,385<br> Output: 4,096 | Sep 2021|
170+
|`gpt-35-turbo` (0125) **NEW** | 16,385 | Sep 2021 |
163171

172+
### GPT-3.5-Turbo model availability
164173

165174
#### Public cloud regions
166175

167-
| Model ID | Model Availability | Max Request (tokens) | Training Data (up to) |
168-
| --------- | -------------------- |:------:|:----:|
169-
| `gpt-35-turbo`**<sup>1</sup>** (0301) | East US <br> France Central <br> South Central US <br> UK South <br> West Europe | 4,096 | Sep 2021 |
170-
| `gpt-35-turbo` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> North Central US <br> Sweden Central <br> Switzerland North <br> UK South | 4,096 | Sep 2021 |
171-
| `gpt-35-turbo-16k` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> North Central US <br> Sweden Central <br> Switzerland North<br> UK South | 16,384 | Sep 2021 |
172-
| `gpt-35-turbo-instruct` (0914) | East US <br> Sweden Central | 4,097 |Sep 2021 |
173-
| `gpt-35-turbo` (1106) | Australia East <br> Canada East <br> France Central <br> South India <br> Sweden Central<br> UK South <br> West US | Input: 16,385<br> Output: 4,096 | Sep 2021|
174-
|`gpt-35-turbo` (0125) **NEW** | Canada East <br> North Central US <br> South Central US | 16,385 | Sep 2021 |
176+
[!INCLUDE [GPT-35-Turbo](includes/model-matrix/standard-gpt-35-turbo.md)]
175177

176178
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
177179

@@ -182,16 +184,20 @@ These models can only be used with Embedding API requests.
182184
> [!NOTE]
183185
> `text-embedding-3-large` is the latest and most capable embedding model. Upgrading between embedding models is not possible. In order to migrate from using `text-embedding-ada-002` to `text-embedding-3-large` you would need to generate new embeddings.
184186
185-
| Model ID | Model Availability | Max Request (tokens) | Output Dimensions |Training Data (up-to)
187+
| Model ID | Max Request (tokens) | Output Dimensions |Training Data (up-to)
186188
|---|---| :---:|:---:|:---:|
187-
| `text-embedding-ada-002` (version 2) | Australia East <br> Canada East <br> East US <br> East US2 <br> France Central <br> Japan East <br> North Central US <br> Norway East <br> South Central US <br> Sweden Central <br> Switzerland North <br> UK South <br> West Europe <br> West US |8,191 | 1,536 | Sep 2021 |
188-
| `text-embedding-ada-002` (version 1) | East US <br> South Central US <br> West Europe |2,046 | 1,536 | Sep 2021 |
189-
| `text-embedding-3-large` | Canada East, East US, East US 2 | 8,191 | 3,072 |Sep 2021 |
190-
| `text-embedding-3-small` | Canada East, East US, East US 2 | 8,191| 1,536 | Sep 2021 |
189+
| `text-embedding-ada-002` (version 2) |8,191 | 1,536 | Sep 2021 |
190+
| `text-embedding-ada-002` (version 1) |2,046 | 1,536 | Sep 2021 |
191+
| `text-embedding-3-large` | 8,191 | 3,072 |Sep 2021 |
192+
| `text-embedding-3-small` | 8,191| 1,536 | Sep 2021 |
191193

192194
> [!NOTE]
193195
> When sending an array of inputs for embedding, the max number of input items in the array per call to the embedding endpoint is 2048.
194196
197+
#### Public cloud regions
198+
199+
[!INCLUDE [Embeddings](includes/model-matrix/standard-embeddings.md)]
200+
195201
#### Azure Government regions
196202

197203
The following Embeddings models are available with [Azure Government](/azure/azure-government/documentation-government-welcome):
@@ -221,7 +227,6 @@ The following Embeddings models are available with [Azure Government](/azure/azu
221227
| `gpt-35-turbo` (1106) | North Central US <br> Sweden Central | Input: 16,385<br> Output: 4,096 | Sep 2021|
222228
| `gpt-35-turbo` (0125) | North Central US <br> Sweden Central | 16,385 | Sep 2021 |
223229

224-
225230
### Whisper models
226231

227232
| Model ID | Model Availability | Max Request (audio file size) |
Lines changed: 34 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,34 @@
1+
---
2+
title: 'Quota regional limits'
3+
titleSuffix: Azure OpenAI Service
4+
description: Quota and limits for Azure OpenAI by region.
5+
manager: nitinme
6+
ms.service: azure-ai-openai
7+
ms.topic: include
8+
ms.date: 03/13/2024
9+
---
10+
11+
## Regional quota limits
12+
13+
The default quota for models varies by model and region. Default quota limits are subject to change.
14+
15+
16+
| Region | Text-Embedding-Ada-002 | text-embedding-3-small | text-embedding-3-large | GPT-35-Turbo | GPT-35-Turbo-Instruct | GPT-4 | GPT-4-32K | GPT-4-Turbo | GPT-4-Turbo-V | Babbage-002 | Babbage-002 - finetune | Davinci-002 | Davinci-002 - finetune | GPT-35-Turbo - finetune | GPT-35-Turbo-1106 - finetune | GPT-35-Turbo-0125 - finetune |
17+
|:-----------------|:-------------------------|:-------------------------|:-------------------------|:---------------|:------------------------|:--------|:------------|:--------------|:----------------|:--------------|:-------------------------|:--------------|:-------------------------|:--------------------------|:-------------------------------|:-------------------------------|
18+
| australiaeast | 350 K | - | - | 300 K | - | 40 K | 80 K | 80 K | 30 K | - | - | - | - | - | - | - |
19+
| brazilsouth | 350 K | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - |
20+
| canadaeast | 350 K | 350 K | 350 K | 300 K | - | 40 K | 80 K | 80 K | - | - | - | - | - | - | - | - |
21+
| eastus | 240 K | 350 K | 350 K | 240 K | 240 K | - | - | 80 K | - | - | - | - | - | - | - | - |
22+
| eastus2 | 350 K | 350 K | 350 K | 300 K | - | 40 K | 80 K | 80 K | - | - | - | - | - | - | - | - |
23+
| francecentral | 240 K | - | - | 240 K | - | 20 K | 60 K | 80 K | - | - | - | - | - | - | - | - |
24+
| japaneast | 350 K | - | - | 313 K | - | 40 K | 80 K | - | 30 K | - | - | - | - | - | - | - |
25+
| northcentralus | 350 K | - | - | 300 K | - | - | - | 80 K | - | 240 K | 250 K | 240 K | 250 K | 250 K | 250 K | 250 K |
26+
| norwayeast | 350 K | - | - | - | - | - | - | 150 K | - | - | - | - | - | - | - | - |
27+
| southafricanorth | 350 K | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - |
28+
| southcentralus | 240 K | - | - | 240 K | - | - | - | 80 K | - | - | - | - | - | - | - | - |
29+
| southindia | 350 K | - | - | 300 K | - | - | - | 150 K | - | - | - | - | - | - | - | - |
30+
| swedencentral | 350 K | - | - | 305 K | 240 K | 40 K | 80 K | 150 K | 30 K | 240 K | 250 K | 240 K | 250 K | 250 K | 250 K | 250 K |
31+
| switzerlandnorth | 350 K | - | - | 300 K | - | 40 K | 80 K | - | 30 K | - | - | - | - | - | - | - |
32+
| uksouth | 350 K | - | - | 240 K | - | 40 K | 80 K | 80 K | - | - | - | - | - | - | - | - |
33+
| westeurope | 240 K | - | - | 240 K | - | - | - | - | - | - | - | - | - | - | - | - |
34+
| westus | 350 K | - | - | 300 K | - | - | - | 80 K | 30 K | - | - | - | - | - | - | - |
Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,29 @@
1+
---
2+
title: Standard Embeddings models
3+
titleSuffix: Azure OpenAI Service
4+
description: embedding model regional availability
5+
manager: nitinme
6+
ms.service: azure-ai-openai
7+
ms.topic: include
8+
ms.date: 03/13/2024
9+
---
10+
11+
| (`Region`) | (`text-embedding-ada-002`, `1`) | (`text-embedding-ada-002`, `2`) | (`text-embedding-3-small`, `1`) | (`text-embedding-3-large`, `1`) |
12+
|:-----------------|:----------------------------------|:----------------------------------|:----------------------------------|:----------------------------------|
13+
| australiaeast | - || - | - |
14+
| brazilsouth | - || - | - |
15+
| canadaeast | - ||||
16+
| eastus |||||
17+
| eastus2 | - ||||
18+
| francecentral | - || - | - |
19+
| japaneast | - || - | - |
20+
| northcentralus | - || - | - |
21+
| norwayeast | - || - | - |
22+
| southafricanorth | - || - | - |
23+
| southcentralus ||| - | - |
24+
| southindia | - || - | - |
25+
| swedencentral | - || - | - |
26+
| switzerlandnorth | - || - | - |
27+
| uksouth | - || - | - |
28+
| westeurope | - || - | - |
29+
| westus | - || - | - |

0 commit comments

Comments
 (0)