You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -19,7 +19,8 @@ Azure OpenAI provides access to many different models, grouped by family and cap
19
19
20
20
| Model family | Description |
21
21
|--|--|
22
-
|[GPT-3](#gpt-3-models)| A series of models that can understand and generate natural language. This includes the new [ChatGPT model](#chatgpt-gpt-35-turbo). |
22
+
|[GPT-4](#gpt-4-models)| A set of models that improve on GPT-3.5 and can understand as well as generate natural language and code. **These models are currently in preview.**|
23
+
|[GPT-3](#gpt-3-models)| A series of models that can understand and generate natural language. This includes the new [ChatGPT model (preview)](#chatgpt-gpt-35-turbo-preview). |
23
24
|[Codex](#codex-models)| A series of models that can understand and generate code, including translating natural language to code. |
24
25
|[Embeddings](#embeddings-models)| A set of models that can understand and use embeddings. An embedding is a special format of data representation that can be easily utilized by machine learning models and algorithms. The embedding is an information dense representation of the semantic meaning of a piece of text. Currently, we offer three families of Embeddings models for different functionalities: similarity, text search, and code search. |
25
26
@@ -53,7 +54,17 @@ You can get a list of models that are available for both inference and fine-tuni
53
54
54
55
## Finding the right model
55
56
56
-
We recommend starting with the most capable model in a model family to confirm whether the model capabilities meet your requirements. Then you can stay with that model or move to a model with lower capability and cost, optimizing around that model's capabilities.
57
+
We recommend starting with the most capable model in a model family to confirm whether the model capabilities meet your requirements. Then you can stay with that model or move to a model with lower capability and cost, optimizing around that model's capabilities.
58
+
59
+
## GPT-4 models (preview)
60
+
61
+
GPT-4 can solve difficult problems with greater accuracy than any of OpenAI's previous models. Like gpt-35-turbo, GPT-4 is optimized for chat but works well for traditional completions tasks.
62
+
63
+
These models are currently in preview. For access, existing Azure OpenAI customers can [apply by filling out this form](https://aka.ms/oai/get-gpt4).
64
+
-`gpt-4`
65
+
-`gpt-4-32k`
66
+
67
+
The `gpt-4` supports 8192 max input tokens and the `gpt-4-32k` supports up to 32,768 tokens.
57
68
58
69
## GPT-3 models
59
70
@@ -92,11 +103,11 @@ Ada is usually the fastest model and can perform tasks like parsing text, addres
The ChatGPT model (gpt-35-turbo) is a language model designed for conversational interfaces and the model behaves differently than previous GPT-3 models. Previous models were text-in and text-out, meaning they accepted a prompt string and returned a completion to append to the prompt. However, the ChatGPT model is conversation-in and message-out. The model expects a prompt string formatted in a specific chat-like transcript format, and returns a completion that represents a model-written message in the chat.
98
109
99
-
The ChatGPT model uses the same completion API that you use for other models like text-davinci-002, but it requires a unique prompt format. It's important to use the new prompt format to get the best results. Without the right prompts, the model tends to be verbose and provides less useful responses. To learn more check out our [in-depth how-to](../how-to/chatgpt.md).
110
+
To learn more about the ChatGPT model and how to interact with the Chat API check out our [in-depth how-to](../how-to/chatgpt.md).
100
111
101
112
## Codex models
102
113
@@ -173,13 +184,23 @@ When using our embeddings models, keep in mind their limitations and risks.
173
184
| davinci<sup>1</sup> | Yes | No | N/A | East US<sup>2</sup>, South Central US, West Europe<sup>2</sup> | 2,049 | Oct 2019|
174
185
| text-davinci-001 | Yes | No | South Central US, West Europe | N/A |||
175
186
| text-davinci-002 | Yes | No | East US, South Central US, West Europe | N/A | 4,097 | Jun 2021 |
176
-
| text-davinci-003 | Yes | No | East US | N/A | 4,097 | Jun 2021 |
187
+
| text-davinci-003 | Yes | No | East US, West Europe| N/A | 4,097 | Jun 2021 |
177
188
| text-davinci-fine-tune-002<sup>1</sup> | Yes | No | N/A | East US, West Europe<sup>2</sup> |||
178
-
| gpt-35-turbo<sup>3</sup> (ChatGPT) | Yes | No | N/A | East US, South Central US | 4,096 | Sep 2021
189
+
| gpt-35-turbo<sup>3</sup> (ChatGPT) (preview) | Yes | No | East US, South Central US | N/A | 4,096 | Sep 2021
179
190
180
191
<sup>1</sup> The model is available by request only. Currently we aren't accepting new requests to use the model.
181
192
<br><sup>2</sup> East US and West Europe are currently unavailable for new customers to fine-tune due to high demand. Please use US South Central region for fine-tuning.
182
-
<br><sup>3</sup> Currently, only version `"0301"` of this model is available. This version of the model will be deprecated on 8/1/2023 in favor of newer version of the gpt-35-model. See [ChatGPT model versioning](../how-to/chatgpt.md#model-versioning) for more details.
193
+
<br><sup>3</sup> Currently, only version `0301` of this model is available. This version of the model will be deprecated on 8/1/2023 in favor of newer version of the gpt-35-model. See [ChatGPT model versioning](../how-to/chatgpt.md#model-versioning) for more details.
194
+
195
+
### GPT-4 Models
196
+
197
+
| Model ID | Supports Completions | Supports Embeddings | Base model Regions | Fine-Tuning Regions | Max Request (tokens) | Training Data (up to) |
0 commit comments