You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/models.md
+11-3Lines changed: 11 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,6 +18,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
18
18
19
19
| Models | Description |
20
20
|--|--|
21
+
|[o1-preview and o1-mini](#o1-preview-and-o1-mini-models-limited-access)| Limited access models, specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. |
21
22
|[GPT-4o & GPT-4o mini & GPT-4 Turbo](#gpt-4o-and-gpt-4-turbo)| The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input. |
22
23
|[GPT-4o audio](#gpt-4o-audio)| A GPT-4o model that supports low-latency, "speech in, speech out" conversational interactions. |
23
24
|[GPT-4](#gpt-4)| A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
@@ -31,6 +32,11 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
31
32
32
33
The Azure OpenAI `o1-preview` and `o1-mini` models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
33
34
35
+
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
36
+
| --- | :--- |:--- |:---: |
37
+
|`o1-preview` (2024-09-12) | The most capable model in the o1 series, offering enhanced reasoning abilities.| Input: 128,000 <br> Output: 32,768 | Oct 2023 |
38
+
|`o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption.| Input: 128,000 <br> Output: 65,536 | Oct 2023 |
39
+
34
40
### Availability
35
41
36
42
The `o1-preview` and `o1-mini` models are now available for API access and model deployment. **Registration is required, and access will be granted based on Microsoft's eligibility criteria**.
@@ -43,7 +49,11 @@ Once access has been granted, you will need to create a deployment for each mode
43
49
44
50
Support for the **o1 series** models was added in API version `2024-09-01-preview`.
45
51
46
-
The `max_tokens` parameter has been deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completions_tokens` parameter.
52
+
The `max_tokens` parameter has been deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completions_tokens` parameter. `max_completions_tokens` is backwards compatible with `max_tokens`.
53
+
54
+
### Region availability
55
+
56
+
Available for standard and global standard deployment in East US2 and Sweden Central for approved customers.
47
57
48
58
## GPT-4o audio
49
59
@@ -99,8 +109,6 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
99
109
100
110
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
101
111
| --- | :--- |:--- |:---: |
102
-
|`o1-preview` (2024-09-12) | The most capable model in the o1 series, offering enhanced reasoning abilities.| Input: 128,000 <br> Output: 32,768 | Oct 2023 |
103
-
|`o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption.| Input: 128,000 <br> Output: 65,536 | Oct 2023 |
104
112
|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)**|**Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 16,384 | Oct 2023 |
105
113
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini**|**Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
106
114
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)**| Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
Copy file name to clipboardExpand all lines: articles/ai-services/openai/whats-new.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -32,7 +32,7 @@ Customers who were already approved and have access to the model through the ear
32
32
33
33
Support for the **o1 series** models was added in API version `2024-09-01-preview`.
34
34
35
-
The `max_tokens` parameter has been deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completions_tokens` parameter.
35
+
The `max_tokens` parameter has been deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completions_tokens` parameter.`max_completions_tokens` is backwards compatible with `max_tokens`.
0 commit comments