You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to either future preview versions or to the latest stable/GA version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
@@ -76,14 +76,20 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
76
76
77
77
GPT-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the GPT-3.5 family is GPT-3.5 Turbo, which has been optimized for chat and works well for traditional completions tasks as well. GPT-3.5 Turbo is available for use with the Chat Completions API. GPT-3.5 Turbo Instruct has similar capabilities to `text-davinci-003` using the Completions API instead of the Chat Completions API. We recommend using GPT-3.5 Turbo and GPT-3.5 Turbo Instruct over [legacy GPT-3.5 and GPT-3 models](./legacy-models.md).
78
78
79
-
-`gpt-35-turbo`
80
-
-`gpt-35-turbo-16k`
81
-
-`gpt-35-turbo-instruct`
82
79
83
-
You can see the token context length supported by each model in the [model summary table](#model-summary-table-and-region-availability).
80
+
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
81
+
| --------- |:---|:------:|:----:|
82
+
|`gpt-35-turbo` (0125) **NEW**|**Latest GA Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) <br> - Higher accuracy at responding in requested formats. <br> - Fix for a bug which caused a text encoding issue for non-English language function calls. | Input: 16,385<br> Output: 4,096 | Sep 2021 |
To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions API check out our [in-depth how-to](../how-to/chatgpt.md).
86
90
91
+
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
92
+
87
93
## Embeddings
88
94
89
95
`text-embedding-3-large` is the latest and most capable embedding model. Upgrading between embeddings models is not possible. In order to move from using `text-embedding-ada-002` to `text-embedding-3-large` you would need to generate new embeddings.
@@ -184,23 +190,12 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
184
190
> [!NOTE]
185
191
> Version `0613` of `gpt-35-turbo` and `gpt-35-turbo-16k` will be retired no earlier than August 1, 2024. Version `0301` of `gpt-35-turbo` will be retired no earlier than August 1, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
186
192
187
-
| Model ID | Max Request (tokens) | Training Data (up to) |
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
203
-
204
199
#### Azure Government regions
205
200
206
201
The following GPT-3.5 turbo models are available with [Azure Government](/azure/azure-government/documentation-government-welcome):
0 commit comments