You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/models.md
+42-31Lines changed: 42 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -46,7 +46,7 @@ You can see the token context length supported by each model in the [model summa
46
46
47
47
To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions API check out our [in-depth how-to](../how-to/chatgpt.md).
48
48
49
-
## Embeddings models
49
+
## Embeddings
50
50
51
51
> [!IMPORTANT]
52
52
> We strongly recommend using `text-embedding-ada-002 (Version 2)`. This model/version provides parity with OpenAI's `text-embedding-ada-002`. To learn more about the improvements offered by this model, please refer to [OpenAI's blog post](https://openai.com/blog/new-and-improved-embedding-model). Even if you are currently using Version 1 you should migrate to Version 2 to take advantage of the latest weights/updated token limit. Version 1 and Version 2 are not interchangeable, so document embedding and document search must be done using the same version of the model.
@@ -80,31 +80,42 @@ These models can only be used with the Chat Completion API.
80
80
81
81
GPT-4 version 0314 is the first version of the model released. Version 0613 is the second version of the model and adds function calling support.
82
82
83
-
| Model ID |Base model Regions | Fine-Tuning Regions |Max Request (tokens) | Training Data (up to) |
84
-
| --- | --- | --- | --- | --- |
85
-
|`gpt-4`<sup>2</sup> (0314) | East US<sup>1</sup>, France Central<sup>1</sup> | N/A<sup>3</sup> |8,192| September 2021 |
86
-
|`gpt-4-32k` <sup>2</sup> (0314)| East US<sup>1</sup>, France Central<sup>1</sup> | N/A<sup>3</sup> | 32,768 | September 2021 |
87
-
|`gpt-4` (0613) | Australia East<sup>1</sup>, Canada East, East US<sup>1</sup>, East US 2<sup>1</sup>, France Central<sup>1</sup>, Japan East<sup>1</sup>, Sweden Central, Switzerland North, UK South<sup>1</sup> | N/A<sup>3</sup> |8,192 | September 2021 |
88
-
|`gpt-4-32k` (0613) | Australia East<sup>1</sup>, Canada East, East US<sup>1</sup>, East US 2<sup>1</sup>, France Central<sup>1</sup>, Japan East<sup>1</sup>, Sweden Central, Switzerland North, UK South<sup>1</sup> | N/A<sup>3</sup> | 32,768 | September 2021 |
83
+
| Model ID | Max Request (tokens) | Training Data (up to) |
84
+
| --- | --- | --- |
85
+
|`gpt-4` (0314) |8,192 | September 2021 |
86
+
|`gpt-4-32k`(0314) | 32,768 | September 2021 |
87
+
|`gpt-4` (0613) | 8,192 | September 2021 |
88
+
|`gpt-4-32k` (0613) | 32,768 | September 2021 |
89
89
90
-
<sup>1</sup> Due to high demand, availability is limited in the region<br>
91
-
<sup>2</sup> Version `0314` of gpt-4 and gpt-4-32k will be retired no earlier than July 5, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.<br>
92
-
<sup>3</sup> Fine-tuning is not supported for GPT-4 models.
90
+
> [!NOTE]
91
+
> Any region where GPT-4 is listed as available will always have access to both the 4K and 32K versions of the model
92
+
93
+
### GPT-4 model availability
94
+
95
+
| Model Availability | gpt-4 (0314) | gpt-4 (0613) |
96
+
|---|---|---|
97
+
| Available to all subscriptions with Azure OpenAI access || Canada East <br> Sweden Central <br> Switzerland North |
98
+
| Available to subscriptions with current access in the region | East US <br> France Central <br> South Central US <br> UK South | Australia East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> UK South |
93
99
94
100
### GPT-3.5 models
95
101
96
102
GPT-3.5 Turbo is used with the Chat Completion API. GPT-3.5 Turbo (0301) can also be used with the Completions API. GPT3.5 Turbo (0613) only supports the Chat Completions API.
97
103
98
104
GPT-3.5 Turbo version 0301 is the first version of the model released. Version 0613 is the second version of the model and adds function calling support.
99
105
100
-
| Model ID | Base model Regions | Fine-Tuning Regions | Max Request (tokens) | Training Data (up to) |
|`gpt-35-turbo`<sup>1</sup> (0301) | East US, France Central, South Central US, UK South, West Europe | N/A | 4,096 | Sep 2021 |
103
-
|`gpt-35-turbo` (0613) | Australia East, Canada East, East US, East US 2, France Central, Japan East, North Central US, Sweden Central, Switzerland North, UK South | North Central US, Sweden Central | 4,096 | Sep 2021 |
104
-
|`gpt-35-turbo-16k` (0613) | Australia East, Canada East, East US, East US 2, France Central, Japan East, North Central US, Sweden Central, Switzerland North, UK South | N/A | 16,384 | Sep 2021 |
105
-
|`gpt-35-turbo-instruct` (0914) | East US, Sweden Central | N/A | 4,097 | Sep 2021 |
106
+
> [!NOTE]
107
+
> Version `0301` of `gpt-35-turbo` will be retired no earlier than July 5, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
108
+
109
+
### GPT-3.5-Turbo model availability
110
+
111
+
| Model ID | Model Availability | Max Request (tokens) | Training Data (up to) |
112
+
| --------- | -------------------- |------|----|
113
+
|`gpt-35-turbo`<sup>1</sup> (0301) | East US <br> France Central <br> South Central US <br> UK South <br> West Europe | 4096 | Sep 2021 |
114
+
|`gpt-35-turbo` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> North Central US <br> Sweden Central <br> Switzerland North <br> UK South | 4096 | Sep 2021 |
115
+
|`gpt-35-turbo-16k` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central, Japan East <br> North Central US <br> Sweden Central <br> Switzerland North<br> UK South | 16,384 | Sep 2021 |
116
+
|`gpt-35-turbo-instruct` (0914) | East US <br> Sweden Central | 4097 |Sep 2021 |
106
117
107
-
<sup>1</sup> Version `0301` of gpt-35-turbo will be retired no earlier than July 5, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
118
+
<sup>1</sup> This model will accept requests > 4096 tokens. It is not recommended to exceed the 4096 input token limit as the newer version of the model are capped at 4096 tokens. If you encounter issues when exceeding 4096 input tokens with this model this configuration is not officially supported.
108
119
109
120
### Embeddings models
110
121
@@ -113,16 +124,16 @@ These models can only be used with Embedding API requests.
113
124
> [!NOTE]
114
125
> We strongly recommend using `text-embedding-ada-002 (Version 2)`. This model/version provides parity with OpenAI's `text-embedding-ada-002`. To learn more about the improvements offered by this model, please refer to [OpenAI's blog post](https://openai.com/blog/new-and-improved-embedding-model). Even if you are currently using Version 1 you should migrate to Version 2 to take advantage of the latest weights/updated token limit. Version 1 and Version 2 are not interchangeable, so document embedding and document search must be done using the same version of the model.
115
126
116
-
| Model ID |Base model Regions | Fine-Tuning Regions| Max Request (tokens) | Training Data (up to) | Output dimensions |
117
-
|---|---| ---|---|---|
118
-
| text-embedding-ada-002 (version 2) | Australia East, Canada East, East US, East US2, France Central, Japan East, North Central US, South Central US, Switzerland North, UK South, West Europe| N/A|8,191 | Sep 2021 | 1536 |
119
-
| text-embedding-ada-002 (version 1) | East US, South Central US, West Europe| N/A|2,046 | Sep 2021 | 1536 |
127
+
| Model ID |Model Availability | Max Request (tokens) | Training Data (up to) | Output dimensions |
128
+
|---|---| ---|---|---|
129
+
|`text-embedding-ada-002`` (version 2) | Australia East <br> Canada East <br> East US <br> East US2 <br> France Central <br> Japan East <br> North Central US <br> South Central US <br> Switzerland North <br> UK South <br> West Europe |8,191 | Sep 2021 | 1536 |
130
+
|`text-embedding-ada-002` (version 1) | East US <br> South Central US <br> West Europe |2,046 | Sep 2021 | 1536 |
120
131
121
132
### DALL-E models (Preview)
122
133
123
-
| Model ID |Base model Regions | Fine-Tuning Regions |Max Request (characters)| Training Data (up to) |
124
-
| --- | --- | --- | --- | --- |
125
-
| dalle2 | East US |N/A |1000| N/A|
134
+
| Model ID |Feature Availability |Max Request (characters) |
135
+
| --- | --- | --- |
136
+
| dalle2 | East US | 1000 |
126
137
127
138
### Fine-tuning models (Preview)
128
139
@@ -131,16 +142,16 @@ These models can only be used with Embedding API requests.
131
142
`gpt-35-turbo-0613` - fine-tuning of this model is limited to a subset of regions, and is not available in every region the base model is available.
132
143
133
144
| Model ID | Fine-Tuning Regions | Max Request (tokens) | Training Data (up to) |
134
-
| --- | --- | --- | --- | --- |
135
-
|`babbage-002`| North Central US, Sweden Central | 16,384 | Sep 2021 |
136
-
|`davinci-002`| North Central US, Sweden Central | 16,384 | Sep 2021 |
137
-
|`gpt-35-turbo` (0613) | North Central US, Sweden Central | 4096 | Sep 2021 |
145
+
| --- | --- | --- | --- |
146
+
|`babbage-002`| North Central US <br> Sweden Central | 16,384 | Sep 2021 |
147
+
|`davinci-002`| North Central US <br> Sweden Central | 16,384 | Sep 2021 |
148
+
|`gpt-35-turbo` (0613) | North Central US <br> Sweden Central | 4096 | Sep 2021 |
138
149
139
150
### Whisper models (Preview)
140
151
141
-
| Model ID |Base model Regions | Fine-Tuning Regions |Max Request (audio file size)| Training Data (up to) |
142
-
| --- | --- | --- | --- | --- |
143
-
| whisper | North Central US, West Europe |N/A |25 MB| N/A|
152
+
| Model ID |Model Availability |Max Request (audio file size) |
153
+
| --- | --- | --- |
154
+
|`whisper`| North Central US <br> West Europe | 25 MB |
0 commit comments