You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -49,12 +49,22 @@ To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions
49
49
50
50
## Embeddings
51
51
52
-
> [!IMPORTANT]
53
-
> We strongly recommend using `text-embedding-ada-002 (Version 2)`. This model/version provides parity with OpenAI's `text-embedding-ada-002`. To learn more about the improvements offered by this model, please refer to [OpenAI's blog post](https://openai.com/blog/new-and-improved-embedding-model). Even if you are currently using Version 1 you should migrate to Version 2 to take advantage of the latest weights/updated token limit. Version 1 and Version 2 are not interchangeable, so document embedding and document search must be done using the same version of the model.
52
+
`text-embedding-3-large` is the latest and most capable embedding model. Upgrading between embeddings models is not possible. In order to move from using `text-embedding-ada-002` to `text-embedding-3-large` you would need to generate new embeddings.
53
+
54
+
-`text-embedding-3-large`
55
+
-`text-embedding-3-small`
56
+
-`text-embedding-ada-002`
57
+
58
+
In testing, OpenAI reports both the large and small third generation embeddings models offer better average multi-language retrieval performance with the [MIRACL](https://github.com/project-miracl/miracl) benchmark while still maintaining performance for English tasks with the [MTEB](https://github.com/embeddings-benchmark/mteb) benchmark.
The previous embeddings models have been consolidated into the following new replacement model:
65
+
The third generation embeddings models support reducing the size of the embedding via a new `dimensions` parameter. Typically larger embeddings are more expensive from a compute, memory, and storage perspective. Being able to adjust the number of dimensions allows more control over overall cost and performance. Official support for the dimensions parameter was added to the OpenAI Python library in version `1.10.0`. If you are running an earlier version of the 1.x library you will need to upgrade `pip install openai --upgrade`.
56
66
57
-
`text-embedding-ada-002`
67
+
OpenAI's MTEB benchmark testing found that even when the third generation model's dimensions are reduced to less than `text-embeddings-ada-002` 1,536 dimensions performance remains slightly better.
58
68
59
69
## DALL-E (Preview)
60
70
@@ -92,7 +102,7 @@ GPT-4 version 0125-preview is an updated version of the GPT-4 Turbo preview prev
92
102
93
103
> [!IMPORTANT]
94
104
>
95
-
> -`gpt-4` version 0125-preview replaces version 1106-preview. Deployments of `gpt-4` version 1106-preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded on February 20, 2024 and will complete upgrades within 2 weeks. Deployments of `gpt-4` version 1106-preview set to "No autoupgrade" will stop working starting February 20, 2024. If you have a deployment of `gpt-4` version 1106-preview, you can test version `0125-preview` in the available regions below.
105
+
> -`gpt-4` version 0125-preview replaces version 1106-preview. Deployments of `gpt-4` version 1106-preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded on March 8th, 2024 and will complete upgrades within 2 weeks. Deployments of `gpt-4` version 1106-preview set to "No autoupgrade" will stop working starting March 8th, 2024. If you have a deployment of `gpt-4` version 1106-preview, you can test version `0125-preview` in the available regions below.
96
106
97
107
| Model ID | Max Request (tokens) | Training Data (up to) |
98
108
| --- | :--- | :---: |
@@ -101,7 +111,7 @@ GPT-4 version 0125-preview is an updated version of the GPT-4 Turbo preview prev
**<sup>1</sup>** GPT-4 Turbo Preview = `gpt-4` (0125-preview). To deploy this model, under **Deployments** select model **gpt-4**. For **Model version** select **0125-preview**.
@@ -137,6 +147,9 @@ The following GPT-4 models are available with [Azure Government](/azure/azure-go
137
147
138
148
### GPT-3.5 models
139
149
150
+
> [!IMPORTANT]
151
+
> The NEW `gpt-35-turbo (0125)` model has various improvements, including higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.
152
+
140
153
GPT-3.5 Turbo is used with the Chat Completion API. GPT-3.5 Turbo version 0301 can also be used with the Completions API. GPT-3.5 Turbo versions 0613 and 1106 only support the Chat Completions API.
141
154
142
155
GPT-3.5 Turbo version 0301 is the first version of the model released. Version 0613 is the second version of the model and adds function calling support.
@@ -148,6 +161,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
148
161
149
162
### GPT-3.5-Turbo model availability
150
163
164
+
151
165
#### Public cloud regions
152
166
153
167
| Model ID | Model Availability | Max Request (tokens) | Training Data (up to) |
@@ -157,6 +171,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
157
171
|`gpt-35-turbo-16k` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> North Central US <br> Sweden Central <br> Switzerland North<br> UK South | 16,384 | Sep 2021 |
158
172
|`gpt-35-turbo-instruct` (0914) | East US <br> Sweden Central | 4,097 |Sep 2021 |
159
173
|`gpt-35-turbo` (1106) | Australia East <br> Canada East <br> France Central <br> South India <br> Sweden Central<br> UK South <br> West US | Input: 16,385<br> Output: 4,096 | Sep 2021|
174
+
|`gpt-35-turbo` (0125) **NEW**| Canada East <br> North Central US <br> South Central US | 16,385 | Sep 2021 |
160
175
161
176
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
162
177
@@ -165,12 +180,14 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
165
180
These models can only be used with Embedding API requests.
166
181
167
182
> [!NOTE]
168
-
> We strongly recommend using `text-embedding-ada-002 (Version 2)`. This model/version provides parity with OpenAI's `text-embedding-ada-002`. To learn more about the improvements offered by this model, please refer to [OpenAI's blog post](https://openai.com/blog/new-and-improved-embedding-model). Even if you are currently using Version 1 you should migrate to Version 2 to take advantage of the latest weights/updated token limit. Version 1 and Version 2 are not interchangeable, so document embedding and document search must be done using the same version of the model.
183
+
> `text-embedding-3-large` is the latest and most capable embedding model. Upgrading between embedding models is not possible. In order to migrate from using `text-embedding-ada-002` to `text-embedding-3-large`you would need to generate new embeddings.
169
184
170
-
| Model ID | Model Availability | Max Request (tokens) | Training Data (upto)| Output Dimensions |
185
+
| Model ID | Model Availability | Max Request (tokens) | Output Dimensions |Training Data (up-to)
171
186
|---|---| :---:|:---:|:---:|
172
-
|`text-embedding-ada-002` (version 2) | Australia East <br> Canada East <br> East US <br> East US2 <br> France Central <br> Japan East <br> North Central US <br> Norway East <br> South Central US <br> Sweden Central <br> Switzerland North <br> UK South <br> West Europe <br> West US |8,191 | Sep 2021 | 1,536 |
173
-
|`text-embedding-ada-002` (version 1) | East US <br> South Central US <br> West Europe |2,046 | Sep 2021 | 1,536 |
187
+
|`text-embedding-ada-002` (version 2) | Australia East <br> Canada East <br> East US <br> East US2 <br> France Central <br> Japan East <br> North Central US <br> Norway East <br> South Central US <br> Sweden Central <br> Switzerland North <br> UK South <br> West Europe <br> West US |8,191 | 1,536 | Sep 2021 |
188
+
|`text-embedding-ada-002` (version 1) | East US <br> South Central US <br> West Europe |2,046 | 1,536 | Sep 2021 |
189
+
|`text-embedding-3-large`| Canada East, East US, East US 2 | 8,191 | 3,072 |Sep 2021 |
190
+
|`text-embedding-3-small`| Canada East, East US, East US 2 | 8,191| 1,536 | Sep 2021 |
174
191
175
192
> [!NOTE]
176
193
> When sending an array of inputs for embedding, the max number of input items in the array per call to the embedding endpoint is 2048.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/tutorials/embeddings.md
+1-4Lines changed: 1 addition & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,10 +26,7 @@ In this tutorial, you learn how to:
26
26
> * Create environment variables for your resources endpoint and API key.
27
27
> * Use the **text-embedding-ada-002 (Version 2)** model
28
28
> * Use [cosine similarity](../concepts/understand-embeddings.md) to rank search results.
29
-
30
-
> [!IMPORTANT]
31
-
> We strongly recommend using `text-embedding-ada-002 (Version 2)`. This model/version provides parity with OpenAI's `text-embedding-ada-002`. To learn more about the improvements offered by this model, please refer to [OpenAI's blog post](https://openai.com/blog/new-and-improved-embedding-model). Even if you are currently using Version 1 you should migrate to Version 2 to take advantage of the latest weights/updated token limit. Version 1 and Version 2 are not interchangeable, so document embedding and document search must be done using the same version of the model.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/whats-new.md
+25-2Lines changed: 25 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
---
2
2
title: What's new in Azure OpenAI Service?
3
3
titleSuffix: Azure AI services
4
-
description: Learn about the latest news and features updates for Azure OpenAI
4
+
description: Learn about the latest news and features updates for Azure OpenAI.
5
5
manager: nitinme
6
6
author: mrbullwinkle
7
7
ms.author: mbullwin
@@ -10,14 +10,37 @@ ms.custom:
10
10
- ignite-2023
11
11
- references_regions
12
12
ms.topic: whats-new
13
-
ms.date: 02/15/2024
13
+
ms.date: 02/21/2024
14
14
recommendations: false
15
15
---
16
16
17
17
# What's new in Azure OpenAI Service
18
18
19
19
## February 2024
20
20
21
+
### GPT-3.5-turbo-0125 model available
22
+
23
+
This model has various improvements, including higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.
24
+
25
+
For information on model regional availability and upgrades refer to the [models page](./concepts/models.md).
26
+
27
+
### Third generation embeddings models available
28
+
29
+
-`text-embedding-3-large`
30
+
-`text-embedding-3-small`
31
+
32
+
In testing, OpenAI reports both the large and small third generation embeddings models offer better average multi-language retrieval performance with the [MIRACL](https://github.com/project-miracl/miracl) benchmark while still maintaining better performance for English tasks with the [MTEB](https://github.com/embeddings-benchmark/mteb) benchmark than the second generation text-embedding-ada-002 model.
33
+
34
+
For information on model regional availability and upgrades refer to the [models page](./concepts/models.md).
35
+
36
+
### GPT-3.5 Turbo quota consolidation
37
+
38
+
To simplify migration between different versions of the GPT-3.5-Turbo models (including 16k), we will be consolidating all GPT-3.5-Turbo quota into a single quota value.
39
+
40
+
- Any customers who have increased quota approved will have combined total quota that reflects the previous increases.
41
+
42
+
- Any customer whose current total usage across model versions is less than the default will get a new combined total quota by default.
43
+
21
44
### GPT-4-0125-preview model available
22
45
23
46
The `gpt-4` model version `0125-preview` is now available on Azure OpenAI Service in the East US, North Central US, and South Central US regions. Customers with deployments of `gpt-4` version `1106-preview` will be automatically upgraded to `0125-preview` in the coming weeks.
0 commit comments