Skip to content

Commit 3f27a1c

Browse files
Merge pull request #275046 from mrbullwinkle/mrb_05_10_2024_mod_002
[Azure OpenAI] table cleanup
2 parents 7ccea80 + 95e10b0 commit 3f27a1c

File tree

1 file changed

+13
-18
lines changed
  • articles/ai-services/openai/concepts

1 file changed

+13
-18
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 13 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -54,10 +54,10 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
5454
| `gpt-4` (0125-Preview)*<br>**GPT-4 Turbo Preview** | **Preview Model** <br> -Replaces 1106-Preview <br>- Better code generation performance <br> - Reduces cases where the model doesn't complete a task <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
5555
| `gpt-4` (vision-preview)<br>**GPT-4 Turbo with Vision Preview** | **Preview model** <br> - Accepts text and image input. <br> - Supports enhancements <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
5656
| `gpt-4` (1106-Preview)<br>**GPT-4 Turbo Preview** | **Preview Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
57-
| `gpt-4-32k` (0613) | **Older GA model** <br> - Basic function calling with tools | 32,768 | Sep 2021 |
57+
| `gpt-4-32k` (0613) | **Older GA model** <br> - Basic function calling with tools | 32,768 | Sep 2021 |
5858
| `gpt-4` (0613) | **Older GA model** <br> - Basic function calling with tools | 8,192 | Sep 2021 |
59-
| `gpt-4-32k`(0314) | **Older GA model** <br> - Deprecated function calling | 32,768 | Sep 2021 |
60-
| `gpt-4` (0314) | **Older GA model** <br> - Deprecated function calling | 8,192 | Sep 2021 |
59+
| `gpt-4-32k`(0314) | **Older GA model** <br> - [Retirement information](./model-retirements.md#current-models) | 32,768 | Sep 2021 |
60+
| `gpt-4` (0314) | **Older GA model** <br> - [Retirement information](./model-retirements.md#current-models) | 8,192 | Sep 2021 |
6161

6262
> [!CAUTION]
6363
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to either future preview versions or to the latest stable/GA version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
@@ -76,14 +76,20 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
7676

7777
GPT-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the GPT-3.5 family is GPT-3.5 Turbo, which has been optimized for chat and works well for traditional completions tasks as well. GPT-3.5 Turbo is available for use with the Chat Completions API. GPT-3.5 Turbo Instruct has similar capabilities to `text-davinci-003` using the Completions API instead of the Chat Completions API. We recommend using GPT-3.5 Turbo and GPT-3.5 Turbo Instruct over [legacy GPT-3.5 and GPT-3 models](./legacy-models.md).
7878

79-
- `gpt-35-turbo`
80-
- `gpt-35-turbo-16k`
81-
- `gpt-35-turbo-instruct`
8279

83-
You can see the token context length supported by each model in the [model summary table](#model-summary-table-and-region-availability).
80+
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
81+
| --------- |:---|:------:|:----:|
82+
| `gpt-35-turbo` (0125) **NEW** | **Latest GA Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) <br> - Higher accuracy at responding in requested formats. <br> - Fix for a bug which caused a text encoding issue for non-English language function calls. | Input: 16,385<br> Output: 4,096 | Sep 2021 |
83+
| `gpt-35-turbo` (1106) | **Older GA Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 16,385<br> Output: 4,096 | Sep 2021|
84+
| `gpt-35-turbo-instruct` (0914) | **Completions endpoint only** | 4,097 |Sep 2021 |
85+
| `gpt-35-turbo-16k` (0613) | **Older GA Model** <br> - Basic function calling with tools | 16,384 | Sep 2021 |
86+
| `gpt-35-turbo` (0613) | **Older GA Model** <br> - Basic function calling with tools | 4,096 | Sep 2021 |
87+
| `gpt-35-turbo`**<sup>1</sup>** (0301) | **Older GA Model** <br> - [Retirement information](./model-retirements.md#current-models) | 4,096 | Sep 2021 |
8488

8589
To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions API check out our [in-depth how-to](../how-to/chatgpt.md).
8690

91+
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
92+
8793
## Embeddings
8894

8995
`text-embedding-3-large` is the latest and most capable embedding model. Upgrading between embeddings models is not possible. In order to move from using `text-embedding-ada-002` to `text-embedding-3-large` you would need to generate new embeddings.
@@ -184,23 +190,12 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
184190
> [!NOTE]
185191
> Version `0613` of `gpt-35-turbo` and `gpt-35-turbo-16k` will be retired no earlier than August 1, 2024. Version `0301` of `gpt-35-turbo` will be retired no earlier than August 1, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
186192
187-
| Model ID | Max Request (tokens) | Training Data (up to) |
188-
| --------- |:------:|:----:|
189-
| `gpt-35-turbo`**<sup>1</sup>** (0301) | 4,096 | Sep 2021 |
190-
| `gpt-35-turbo` (0613) | 4,096 | Sep 2021 |
191-
| `gpt-35-turbo-16k` (0613) | 16,384 | Sep 2021 |
192-
| `gpt-35-turbo-instruct` (0914) | 4,097 |Sep 2021 |
193-
| `gpt-35-turbo` (1106) | Input: 16,385<br> Output: 4,096 | Sep 2021|
194-
| `gpt-35-turbo` (0125) **NEW** | Input: 16,385<br> Output: 4,096 | Sep 2021 |
195-
196193
### GPT-3.5-Turbo model availability
197194

198195
#### Public cloud regions
199196

200197
[!INCLUDE [GPT-35-Turbo](../includes/model-matrix/standard-gpt-35-turbo.md)]
201198

202-
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
203-
204199
#### Azure Government regions
205200

206201
The following GPT-3.5 turbo models are available with [Azure Government](/azure/azure-government/documentation-government-welcome):

0 commit comments

Comments
 (0)