Skip to content

Commit 776ad9d

Browse files
authored
Merge pull request #279829 from MicrosoftDocs/main
7/1/2024 PM Publish
2 parents e27dd17 + d53e669 commit 776ad9d

File tree

97 files changed

+2719
-1346
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

97 files changed

+2719
-1346
lines changed

.openpublishing.redirection.json

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4193,11 +4193,17 @@
41934193
"redirect_url": "/azure",
41944194
"redirect_document_id": false
41954195
},
4196+
{
4197+
4198+
"source_path_from_root": "/articles/site-recovery/move-vaults-across-regions.md",
4199+
"redirect_url": "/azure/operational-excellence/relocation-site-recovery",
4200+
"redirect_document_id": false
4201+
},
41964202
{
41974203
"source_path_from_root": "/articles/defender-for-iot/organizations/tutorial-clearpass.md",
41984204
"redirect_url": "/azure/defender-for-iot/organizations/integrate-overview",
41994205
"redirect_document_id": false
42004206
}
4201-
4207+
42024208
]
42034209
}

articles/ai-services/language-service/question-answering/overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.custom: language-service-question-answering
1414
# What is custom question answering?
1515

1616
> [!NOTE]
17-
> [Azure Open AI On Your Data](../../openai/concepts/use-your-data.md) utilizes large language models (LLMs) to produce similar results to Custom Question Answering. If you wish to connect an existing Custom Question Answering project to Azure Open AI On Your Data, please check out our [guide]( how-to/azure-openai-integration.md).
17+
> [Azure OpenAI On Your Data](../../openai/concepts/use-your-data.md) utilizes large language models (LLMs) to produce similar results to Custom Question Answering. If you wish to connect an existing Custom Question Answering project to Azure OpenAI On Your Data, please check out our [guide]( how-to/azure-openai-integration.md).
1818
1919
Custom question answering provides cloud-based Natural Language Processing (NLP) that allows you to create a natural conversational layer over your data. It is used to find appropriate answers from customer input or from a project.
2020

articles/ai-services/openai/api-version-deprecation.md

Lines changed: 2 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ services: cognitive-services
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: conceptual
8-
ms.date: 05/20/2024
8+
ms.date: 07/01/2024
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
recommendations: false
@@ -14,7 +14,7 @@ ms.custom:
1414

1515
# Azure OpenAI API preview lifecycle
1616

17-
This article is to help you understand the support lifecycle for the Azure OpenAI API previews. New preview APIs target a monthly release cadence. After July 1, 2024, the latest three preview APIs will remain supported while older APIs will no longer be supported unless support is explicitly indicated.
17+
This article is to help you understand the support lifecycle for the Azure OpenAI API previews. New preview APIs target a monthly release cadence. After February 3rd, 2025, the latest three preview APIs will remain supported while older APIs will no longer be supported unless support is explicitly indicated.
1818

1919
> [!NOTE]
2020
> The `2023-06-01-preview` API will remain supported at this time, as `DALL-E 2` is only available in this API version. `DALL-E 3` is supported in the latest API releases. The `2023-10-01-preview` API will also remain supported at this time.
@@ -59,18 +59,6 @@ is currently the latest GA API release. This API version is the replacement for
5959

6060
This version contains support for the latest GA features like Whisper, DALL-E 3, fine-tuning, on your data, etc. Any preview features that were released after the `2023-12-01-preview` release like Assistants, TTS, certain on your data datasources, are only supported in the latest preview API releases.
6161

62-
## Retiring soon
63-
64-
On July 1, 2024 the following API preview releases will be retired and will stop accepting API requests:
65-
66-
- 2023-03-15-preview
67-
- 2023-07-01-preview
68-
- 2023-08-01-preview
69-
- 2023-09-01-preview
70-
- 2023-12-01-preview
71-
72-
To avoid service disruptions, you must update to use the latest preview version before the retirement date.
73-
7462
## Updating API versions
7563

7664
We recommend first testing the upgrade to new API versions to confirm there's no impact to your application from the API update before making the change globally across your environment.

articles/ai-services/openai/concepts/models.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 06/25/2024
7+
ms.date: 07/01/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -34,7 +34,7 @@ GPT-4o is the latest model from OpenAI. GPT-4o integrates text and images in a s
3434

3535
GPT-4o is available for **standard** and **global-standard** model deployment.
3636

37-
You need to [create](../how-to/create-resource.md) or use an existing resource in a [supported standard](#gpt-4-and-gpt-4-turbo-model-availability) or [global standard](#global-standard-model-availability-preview) region where the model is available.
37+
You need to [create](../how-to/create-resource.md) or use an existing resource in a [supported standard](#gpt-4-and-gpt-4-turbo-model-availability) or [global standard](#global-standard-model-availability) region where the model is available.
3838

3939
When your resource is created, you can [deploy](../how-to/create-resource.md#deploy-a-model) the GPT-4o model. If you are performing a programmatic deployment, the **model** name is `gpt-4o`, and the **version** is `2024-05-13`.
4040

@@ -164,7 +164,7 @@ You need to speak with your Microsoft sales/account team to acquire provisioned
164164

165165
For more information on Provisioned deployments, see our [Provisioned guidance](./provisioned-throughput.md).
166166

167-
### Global standard model availability (preview)
167+
### Global standard model availability
168168

169169
**Supported models:**
170170

articles/ai-services/openai/how-to/deployment-types.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ author: mrbullwinkle
77
manager: nitinme
88
ms.service: azure-ai-openai
99
ms.topic: how-to
10-
ms.date: 05/19/2024
10+
ms.date: 07/01/2024
1111
ms.author: mbullwin
1212
---
1313

@@ -28,7 +28,7 @@ Our global deployments will be the first location for all new models and feature
2828

2929
Azure OpenAI offers three types of deployments. These provide a varied level of capabilities that provide trade-offs on: throughput, SLAs, and price. Below is a summary of the options followed by a deeper description of each.
3030

31-
| **Offering** | **Global-Standard** <sup>**1**</sup> | **Standard** | **Provisioned** |
31+
| **Offering** | **Global-Standard** | **Standard** | **Provisioned** |
3232
|---|:---|:---|:---|
3333
| **Best suited for** | Applications that don’t require data residency. Recommended starting place for customers. | For customers with data residency requirements. Optimized for low to medium volume. | Real-time scoring for large consistent volume. Includes the highest commitments and limits.|
3434
| **How it works** | Traffic may be routed anywhere in the world | | |
@@ -40,8 +40,6 @@ Azure OpenAI offers three types of deployments. These provide a varied level of
4040
| **Sku Name in code** | `GlobalStandard` | `Standard` | `ProvisionedManaged` |
4141
| **Billing model** | Pay-per-token | Pay-per-token | Monthly Commitments |
4242

43-
<sup>**1**</sup> Global-Standard deployment type is currently in preview.
44-
4543
## Provisioned
4644

4745
Provisioned deployments allow you to specify the amount of throughput you require in a deployment. The service then allocates the necessary model processing capacity and ensures it's ready for you. Throughput is defined in terms of provisioned throughput units (PTU) which is a normalized way of representing the throughput for your deployment. Each model-version pair requires different amounts of PTU to deploy and provide different amounts of throughput per PTU. Learn more from our [Provisioned throughput concepts article](../concepts/provisioned-throughput.md).
@@ -52,7 +50,7 @@ Standard deployments provide a pay-per-call billing model on the chosen model. P
5250

5351
Standard deployments are optimized for low to medium volume workloads with high burstiness. Customers with high consistent volume may experience greater latency variability.
5452

55-
## Global standard (preview)
53+
## Global standard
5654

5755
Global deployments are available in the same Azure OpenAI resources as non-global offers but allow you to leverage Azure's global infrastructure to dynamically route traffic to the data center with best availability for each request. Global standard will provide the highest default quota for new models and eliminates the need to load balance across multiple resources.
5856

articles/ai-services/openai/how-to/use-web-app.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ Sample source code for the web app is available on [GitHub](https://github.com/m
5050
> [!NOTE]
5151
> After February 1, 2024, the web app requires the app startup command to be set to `python3 -m gunicorn app:app`. When updating an app that was published prior to February 1, 2024, you need to manually add the startup command from the **App Service Configuration** page.
5252
53-
We recommend pulling changes from the `main` branch for the web app's source code frequently to ensure you have the latest bug fixes, API version, and improvements. Additionally, the web app must be synchronized every time the API version being used is [retired](../api-version-deprecation.md#retiring-soon).
53+
We recommend pulling changes from the `main` branch for the web app's source code frequently to ensure you have the latest bug fixes, API version, and improvements. Additionally, the web app must be synchronized every time the API version being used is [retired](../api-version-deprecation.md)
5454

5555
Consider either clicking the **watch** or **star** buttons on the web app's [GitHub](https://github.com/microsoft/sample-app-aoai-chatGPT) repo to be notified about changes and updates to the source code.
5656

articles/ai-services/openai/quotas-limits.md

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 06/21/2024
13+
ms.date: 07/01/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -60,9 +60,6 @@ The following sections provide you with a quick guide to the default quotas and
6060

6161
### gpt-4o global standard
6262

63-
> [!NOTE]
64-
> The [global standard model deployment type](./how-to/deployment-types.md#deployment-types) is currently in public preview.
65-
6663
|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
6764
|---|:---:|:---:|
6865
|Enterprise agreement | 10 M | 60 K |

0 commit comments

Comments
 (0)