Skip to content

Commit 611aeb4

Browse files
Merge pull request #258867 from mrbullwinkle/mrb_11_16_2023_models
Azure OpenAI [Release branch] Models and What's New update
2 parents d773bc9 + 7c28830 commit 611aeb4

File tree

2 files changed

+23
-9
lines changed

2 files changed

+23
-9
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 12 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 10/04/2023
7+
ms.date: 11/17/2023
88
ms.custom: event-tier1-build-2022, references_regions, build-2023, build-2023-dataai
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -86,20 +86,23 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
8686
> Version `0314` of `gpt-4` and `gpt-4-32k` will be retired no earlier than July 5, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
8787
8888
| Model ID | Max Request (tokens) | Training Data (up to) |
89-
| --- | :---: | :---: |
89+
| --- | :--- | :---: |
9090
| `gpt-4` (0314) | 8,192 | Sep 2021 |
9191
| `gpt-4-32k`(0314) | 32,768 | Sep 2021 |
9292
| `gpt-4` (0613) | 8,192 | Sep 2021 |
9393
| `gpt-4-32k` (0613) | 32,768 | Sep 2021 |
94+
| `gpt-4` (1106-preview)**<sup>1</sup>** | Input: 128,000 <br> Output: 4096 | Apr 2023 |
95+
96+
**<sup>1</sup>** We don't recommend using this model in production. We will upgrade all deployments of this model to a future stable version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
9497

9598
> [!NOTE]
96-
> Regions where GPT-4 is listed as available have access to both the 8K and 32K versions of the model
99+
> Regions where GPT-4 (0314) & (0613) are listed as available have access to both the 8K and 32K versions of the model
97100
98101
### GPT-4 model availability
99102

100-
| Model Availability | gpt-4 (0314) | gpt-4 (0613) |
101-
|---|:---|:---|
102-
| Available to all subscriptions with Azure OpenAI access | | Australia East <br> Canada East <br> France Central <br> Sweden Central <br> Switzerland North |
103+
| Model Availability | gpt-4 (0314) | gpt-4 (0613) | gpt-4 (1106-preview) |
104+
|---|:---|:---|:---|
105+
| Available to all subscriptions with Azure OpenAI access | | Australia East <br> Canada East <br> France Central <br> Sweden Central <br> Switzerland North | Australia East <br> Canada East <br> East US 2 <br> France Central <br> Norway East <br> South India <br> Sweden Central <br> UK South <br> West US |
103106
| Available to subscriptions with current access to the model version in the region | East US <br> France Central <br> South Central US <br> UK South | East US <br> East US 2 <br> Japan East <br> UK South |
104107

105108
### GPT-3.5 models
@@ -117,12 +120,13 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
117120

118121
| Model ID | Model Availability | Max Request (tokens) | Training Data (up to) |
119122
| --------- | -------------------- |:------:|:----:|
120-
| `gpt-35-turbo`<sup>1</sup> (0301) | East US <br> France Central <br> South Central US <br> UK South <br> West Europe | 4096 | Sep 2021 |
123+
| `gpt-35-turbo`**<sup>1</sup>** (0301) | East US <br> France Central <br> South Central US <br> UK South <br> West Europe | 4096 | Sep 2021 |
121124
| `gpt-35-turbo` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> North Central US <br> Sweden Central <br> Switzerland North <br> UK South | 4096 | Sep 2021 |
122125
| `gpt-35-turbo-16k` (0613) | Australia East <br> Canada East <br> East US <br> East US 2 <br> France Central <br> Japan East <br> North Central US <br> Sweden Central <br> Switzerland North<br> UK South | 16,384 | Sep 2021 |
123126
| `gpt-35-turbo-instruct` (0914) | East US <br> Sweden Central | 4097 |Sep 2021 |
127+
| `gpt-35-turbo` (1106) | Australia East <br> Canada East <br> France Central <br> South India <br> Sweden Central<br> UK South<br> West US | Input: 16,385<br> Output: 4,096 | Sep 2021|
124128

125-
<sup>1</sup> This model will accept requests > 4096 tokens. It is not recommended to exceed the 4096 input token limit as the newer version of the model are capped at 4096 tokens. If you encounter issues when exceeding 4096 input tokens with this model this configuration is not officially supported.
129+
**<sup>1</sup>** This model will accept requests > 4096 tokens. It is not recommended to exceed the 4096 input token limit as the newer version of the model are capped at 4096 tokens. If you encounter issues when exceeding 4096 input tokens with this model this configuration is not officially supported.
126130

127131
### Embeddings models
128132

articles/ai-services/openai/whats-new.md

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.service: azure-ai-openai
99
ms.custom:
1010
- ignite-2023
1111
ms.topic: whats-new
12-
ms.date: 10/16/2023
12+
ms.date: 10/17/2023
1313
recommendations: false
1414
keywords:
1515
---
@@ -18,6 +18,16 @@ keywords:
1818

1919
## November 2023
2020

21+
### GPT-4-1106-preview & GPT-3.5-Turbo-1106 released
22+
23+
Both models are the latest release from OpenAI with improved instruction following, [JSON mode](./how-to/json-mode.md), [reproducible output](./how-to/reproducible-output.md), and parallel function calling.
24+
25+
- **GPT-4-1106-preview (also known as GPT-4 Turbo)** has a max context window of 128,000 tokens and can generate 4,096 output tokens. It has the latest training data with knowledge up to April 2023. This model is in preview and is not recommended for production use. All deployments of this preview model will be automatically updated in place once the stable release becomes available.
26+
27+
- **GPT-3.5-Turbo-1106** has a max context window of 16,385 tokens and can generate 4,096 output tokens.
28+
29+
For information on model regional availability consult the [models page](./concepts/models.md).
30+
2131
### DALL-E 3 public preview
2232

2333
DALL-E 3 is the latest image generation model from OpenAI. It features enhanced image quality, more complex scenes, and improved performance when rendering text in images. It also comes with more aspect ratio options. DALL-E 3 is available through OpenAI Studio and through the REST API. Your OpenAI resource must be in the `SwedenCentral` Azure region.

0 commit comments

Comments
 (0)