Skip to content

Commit 56853c4

Browse files
Learn Build Service GitHub AppLearn Build Service GitHub App
authored andcommitted
Merging changes synced from https://github.com/MicrosoftDocs/azure-docs-pr (branch live)
2 parents 10ed9c1 + 4ecf6a5 commit 56853c4

File tree

112 files changed

+3322
-1579
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

112 files changed

+3322
-1579
lines changed

.openpublishing.redirection.json

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,15 @@
11
{
22
"redirections": [
3+
{
4+
"source_path": "articles/network-watcher/network-watcher-security-group-view-powershell.md",
5+
"redirect_url": "/previous-versions/azure/network-watcher/network-watcher-security-group-view-powershell",
6+
"redirect_document_id": false
7+
},
8+
{
9+
"source_path": "articles/network-watcher/network-watcher-security-group-view-cli.md",
10+
"redirect_url": "/previous-versions/azure/network-watcher/network-watcher-security-group-view-cli",
11+
"redirect_document_id": false
12+
},
313
{
414
"source_path": "articles/storage/blobs/blob-v11-samples-dotnet.md",
515
"redirect_url": "/previous-versions/azure/storage/blobs/blob-v11-samples-dotnet",
@@ -3430,12 +3440,6 @@
34303440
"redirect_url": "/azure/ai-services/anomaly-detector/quickstarts/detect-data-anomalies-python",
34313441
"redirect_document_id": false
34323442
},
3433-
{
3434-
"source_path_from_root": "/articles/ai-studio/how-to/model-catalog.md",
3435-
"redirect_url": "/azure/ai-studio/how-to/model-catalog-overview",
3436-
"redirect_document_id": false
3437-
},
3438-
34393443
{
34403444
"source_path_from_root": "/articles/service-fabric/upgrade-managed-disks.md",
34413445
"redirect_url": "/azure/service-fabric/service-fabric-scale-up-primary-node-type",

articles/ai-services/openai/concepts/models.md

Lines changed: 24 additions & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -14,11 +14,11 @@ recommendations: false
1414

1515
# Azure OpenAI Service models
1616

17-
Azure OpenAI Service is powered by a diverse set of models with different capabilities and price points. Model availability varies by region. For GPT-3 and other models retiring in July 2024, see [Azure OpenAI Service legacy models](./legacy-models.md).
17+
Azure OpenAI Service is powered by a diverse set of models with different capabilities and price points. Model availability varies by region. For GPT-3 and other models retiring in July 2024, see [Azure OpenAI Service legacy models](./legacy-models.md).
1818

1919
| Models | Description |
2020
|--|--|
21-
| [GPT-4 Turbo 🆕](#gpt-4-turbo) | The latest most capable Azure OpenAI models with multimodal versions which can accept both text and images as input. |
21+
| [GPT-4 Turbo **NEW**](#gpt-4-turbo) | The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input. |
2222
| [GPT-4](#gpt-4) | A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
2323
| [GPT-3.5](#gpt-35) | A set of models that improve on GPT-3 and can understand and generate natural language and code. |
2424
| [Embeddings](#embeddings-models) | A set of models that can convert text into numerical vector form to facilitate text similarity. |
@@ -48,29 +48,25 @@ You can see the token context length supported by each model in the [model summa
4848

4949
See [model versions](../concepts/model-versions.md) to learn about how Azure OpenAI Service handles model version upgrades, and [working with models](../how-to/working-with-models.md) to learn how to view and configure the model version settings of your GPT-4 deployments.
5050

51-
| Model ID | Max Request (tokens) | Training Data (up to) |
52-
| --- | :--- | :---: |
53-
| `gpt-4` (0314) | 8,192 | Sep 2021 |
54-
| `gpt-4-32k`(0314) | 32,768 | Sep 2021 |
55-
| `gpt-4` (0613) | 8,192 | Sep 2021 |
56-
| `gpt-4-32k` (0613) | 32,768 | Sep 2021 |
57-
| `gpt-4` (1106-Preview)**<sup>1</sup>**<br>**GPT-4 Turbo Preview** | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
58-
| `gpt-4` (0125-Preview)**<sup>1</sup>**<br>**GPT-4 Turbo Preview** | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
59-
| `gpt-4` (vision-preview)**<sup>2</sup>**<br>**GPT-4 Turbo with Vision Preview** | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
60-
| `gpt-4` (turbo-2024-04-09) 🆕 <br>**GPT-4 Turbo with Vision GA** | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
61-
62-
**<sup>1</sup>** GPT-4 Turbo Preview = `gpt-4` (0125-Preview) or `gpt-4` (1106-Preview). To deploy this model, under **Deployments** select model **gpt-4**. Under version select (0125-Preview) or (1106-Preview).
63-
64-
**<sup>2</sup>** GPT-4 Turbo with Vision Preview = `gpt-4` (vision-preview). To deploy this model, under **Deployments** select model **gpt-4**. For **Model version** select **vision-preview**.
51+
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
52+
| --- | :--- |:--- |:---: |
53+
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **Latest GA model** <br> - Replacement for all GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4-turbo) is currently different depending on method of input, and deployment type. <br> - Does **not support** enhancements. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
54+
| `gpt-4` (0125-Preview)*<br>**GPT-4 Turbo Preview** | **Preview Model** <br> -Replaces 1106-Preview <br>- Better code generation performance <br> - Reduces cases where the model doesn't complete a task <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
55+
| `gpt-4` (vision-preview)<br>**GPT-4 Turbo with Vision Preview** | **Preview model** <br> - Accepts text and image input. <br> - Supports enhancements <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
56+
| `gpt-4` (1106-Preview)<br>**GPT-4 Turbo Preview** | **Preview Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
57+
| `gpt-4-32k` (0613) | **Older GA model** <br> - Basic function calling with tools | 32,768 | Sep 2021 |
58+
| `gpt-4` (0613) | **Older GA model** <br> - Basic function calling with tools | 8,192 | Sep 2021 |
59+
| `gpt-4-32k`(0314) | **Older GA model** <br> - [Retirement information](./model-retirements.md#current-models) | 32,768 | Sep 2021 |
60+
| `gpt-4` (0314) | **Older GA model** <br> - [Retirement information](./model-retirements.md#current-models) | 8,192 | Sep 2021 |
6561

6662
> [!CAUTION]
67-
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to future preview versions and a stable version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
63+
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to either future preview versions or to the latest stable/GA version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
6864
6965
> [!NOTE]
7066
> Version `0314` of `gpt-4` and `gpt-4-32k` will be retired no earlier than July 5, 2024. Version `0613` of `gpt-4` and `gpt-4-32k` will be retired no earlier than September 30, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
7167
7268
- GPT-4 version 0125-preview is an updated version of the GPT-4 Turbo preview previously released as version 1106-preview.
73-
- GPT-4 version 0125-preview completes tasks such as code generation more completely compared to gpt-4-1106-preview. Because of this, depending on the task, customers may find that GPT-4-0125-preview generates more output compared to the gpt-4-1106-preview. We recommend customers compare the outputs of the new model. GPT-4-0125-preview also addresses bugs in gpt-4-1106-preview with UTF-8 handling for non-English languages. GPT-4 version `turbo-2024-04-09` is the latest GA release and replaces `0125-Preview`, `1106-preview`, and `vision-preview`.
69+
- GPT-4 version 0125-preview completes tasks such as code generation more completely compared to gpt-4-1106-preview. Because of this, depending on the task, customers may find that GPT-4-0125-preview generates more output compared to the gpt-4-1106-preview. We recommend customers compare the outputs of the new model. GPT-4-0125-preview also addresses bugs in gpt-4-1106-preview with UTF-8 handling for non-English languages. GPT-4 version `turbo-2024-04-09` is the latest GA release and replaces `0125-Preview`, `1106-preview`, and `vision-preview`.
7470

7571
> [!IMPORTANT]
7672
>
@@ -80,14 +76,20 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
8076

8177
GPT-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the GPT-3.5 family is GPT-3.5 Turbo, which has been optimized for chat and works well for traditional completions tasks as well. GPT-3.5 Turbo is available for use with the Chat Completions API. GPT-3.5 Turbo Instruct has similar capabilities to `text-davinci-003` using the Completions API instead of the Chat Completions API. We recommend using GPT-3.5 Turbo and GPT-3.5 Turbo Instruct over [legacy GPT-3.5 and GPT-3 models](./legacy-models.md).
8278

83-
- `gpt-35-turbo`
84-
- `gpt-35-turbo-16k`
85-
- `gpt-35-turbo-instruct`
8679

87-
You can see the token context length supported by each model in the [model summary table](#model-summary-table-and-region-availability).
80+
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
81+
| --------- |:---|:------:|:----:|
82+
| `gpt-35-turbo` (0125) **NEW** | **Latest GA Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) <br> - Higher accuracy at responding in requested formats. <br> - Fix for a bug which caused a text encoding issue for non-English language function calls. | Input: 16,385<br> Output: 4,096 | Sep 2021 |
83+
| `gpt-35-turbo` (1106) | **Older GA Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 16,385<br> Output: 4,096 | Sep 2021|
84+
| `gpt-35-turbo-instruct` (0914) | **Completions endpoint only** | 4,097 |Sep 2021 |
85+
| `gpt-35-turbo-16k` (0613) | **Older GA Model** <br> - Basic function calling with tools | 16,384 | Sep 2021 |
86+
| `gpt-35-turbo` (0613) | **Older GA Model** <br> - Basic function calling with tools | 4,096 | Sep 2021 |
87+
| `gpt-35-turbo`**<sup>1</sup>** (0301) | **Older GA Model** <br> - [Retirement information](./model-retirements.md#current-models) | 4,096 | Sep 2021 |
8888

8989
To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions API check out our [in-depth how-to](../how-to/chatgpt.md).
9090

91+
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
92+
9193
## Embeddings
9294

9395
`text-embedding-3-large` is the latest and most capable embedding model. Upgrading between embeddings models is not possible. In order to move from using `text-embedding-ada-002` to `text-embedding-3-large` you would need to generate new embeddings.
@@ -188,23 +190,12 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
188190
> [!NOTE]
189191
> Version `0613` of `gpt-35-turbo` and `gpt-35-turbo-16k` will be retired no earlier than August 1, 2024. Version `0301` of `gpt-35-turbo` will be retired no earlier than August 1, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
190192
191-
| Model ID | Max Request (tokens) | Training Data (up to) |
192-
| --------- |:------:|:----:|
193-
| `gpt-35-turbo`**<sup>1</sup>** (0301) | 4,096 | Sep 2021 |
194-
| `gpt-35-turbo` (0613) | 4,096 | Sep 2021 |
195-
| `gpt-35-turbo-16k` (0613) | 16,384 | Sep 2021 |
196-
| `gpt-35-turbo-instruct` (0914) | 4,097 |Sep 2021 |
197-
| `gpt-35-turbo` (1106) | Input: 16,385<br> Output: 4,096 | Sep 2021|
198-
| `gpt-35-turbo` (0125) **NEW** | Input: 16,385<br> Output: 4,096 | Sep 2021 |
199-
200193
### GPT-3.5-Turbo model availability
201194

202195
#### Public cloud regions
203196

204197
[!INCLUDE [GPT-35-Turbo](../includes/model-matrix/standard-gpt-35-turbo.md)]
205198

206-
**<sup>1</sup>** This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
207-
208199
#### Azure Government regions
209200

210201
The following GPT-3.5 turbo models are available with [Azure Government](/azure/azure-government/documentation-government-welcome):

articles/ai-services/openai/concepts/use-your-data.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -169,8 +169,7 @@ You might want to use Azure Blob Storage as a data source if you want to connect
169169
## Schedule automatic index refreshes
170170

171171
> [!NOTE]
172-
> * Automatic index refreshing is supported for Azure Blob Storage only.
173-
> * If a document is deleted from input blob container, the corresponding chunk index records won't be removed by the scheduled refresh.
172+
> Automatic index refreshing is supported for Azure Blob Storage only.
174173
175174
To keep your Azure AI Search index up-to-date with your latest data, you can schedule an automatic index refresh rather than manually updating it every time your data is updated. Automatic index refresh is only available when you choose **Azure Blob Storage** as the data source. To enable an automatic index refresh:
176175

articles/ai-studio/.openpublishing.redirection.ai-studio.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,11 @@
2424
"source_path_from_root": "/articles/ai-studio/quickstarts/assistants.md",
2525
"redirect_url": "/azure/ai-services/openai/assistants-quickstart",
2626
"redirect_document_id": true
27+
},
28+
{
29+
"source_path_from_root": "/articles/ai-studio/how-to/model-catalog.md",
30+
"redirect_url": "/azure/ai-studio/how-to/model-catalog-overview",
31+
"redirect_document_id": false
2732
}
2833
]
2934
}

0 commit comments

Comments
 (0)