Skip to content

Commit 2aae86d

Browse files
committed
update
1 parent ec86304 commit 2aae86d

File tree

2 files changed

+26
-2
lines changed

2 files changed

+26
-2
lines changed

articles/ai-services/openai/how-to/deployment-types.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ Azure OpenAI offers three types of deployments. These provide a varied level of
3030

3131
| **Offering** | **Global-Batch** | **Global-Standard** | **Standard** | **Provisioned** |
3232
|---|:---|:---|:---|:---|
33-
| **Best suited for** | Offline scoring <br><br> Workloads that are not latency sensitive and can be completed in hours.| Applications that don’t require data residency. Recommended starting place for customers. | For customers with data residency requirements. Optimized for low to medium volume. | Real-time scoring for large consistent volume. Includes the highest commitments and limits.|
33+
| **Best suited for** | Offline scoring <br><br> Workloads that are not latency sensitive and can be completed in hours.<br><br> For use cases that do not have data processing residency requirements.| Recommended starting place for customers. <br><br>Global-Standard will have the higher default quota and larger number of models available than Standard. <br><br> For production applications that do not have data processing residency requirements. | For customers with data processing residency requirements. | For customers with data residency requirements. Optimized for low to medium volume. | Real-time scoring for large consistent volume. Includes the highest commitments and limits.|
3434
| **How it works** | Offline processing via files |Traffic may be routed anywhere in the world | | |
3535
| **Getting started** | [Global-Batch](./batch.md) | [Model deployment](./create-resource.md) | [Model deployment](./create-resource.md) | [Provisioned onboarding](./provisioned-throughput-onboarding.md) |
3636
| **Cost** | [Least expensive option](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) <br> 50% less cost compared to Global Standard prices. Access to all new models with larger quota allocations. | [Global deployment pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) | [Regional pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) | May experience cost savings for consistent usage |

articles/ai-services/openai/whats-new.md

Lines changed: 25 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,14 +10,38 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: whats-new
13-
ms.date: 08/02/2024
13+
ms.date: 08/05/2024
1414
recommendations: false
1515
---
1616

1717
# What's new in Azure OpenAI Service
1818

1919
This article provides a summary of the latest releases and major documentation updates for Azure OpenAI.
2020

21+
## August 2024
22+
23+
### Global batch deployments are now available
24+
25+
The Azure OpenAI Batch API is designed to handle large-scale and high-volume processing tasks efficiently. Process asynchronous groups of requests with separate quota, a 24-hour turnaround time, at [50% less cost than global standard](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/). With batch processing, rather than send one request at a time you send a large number of requests in a single file. Global batch requests have a separate enqueued token quota avoiding any disruption of your online workloads.
26+
27+
Key use cases include:
28+
29+
* **Large-Scale Data Processing:** Quickly analyze extensive datasets in parallel.
30+
31+
* **Content Generation:** Create large volumes of text, such as product descriptions or articles.
32+
33+
* **Document Review and Summarization:** Automate the review and summarization of lengthy documents.
34+
35+
* **Customer Support Automation:** Handle numerous queries simultaneously for faster responses.
36+
37+
* **Data Extraction and Analysis:** Extract and analyze information from vast amounts of unstructured data.
38+
39+
* **Natural Language Processing (NLP) Tasks:** Perform tasks like sentiment analysis or translation on large datasets.
40+
41+
* **Marketing and Personalization:** Generate personalized content and recommendations at scale.
42+
43+
For more information on [getting started with global batch deployments](./how-to/batch.md).
44+
2145
## July 2024
2246

2347
### GPT-4o mini is now available for fine-tuning

0 commit comments

Comments
 (0)