Skip to content

Commit 5f236cf

Browse files
authored
Merge pull request #5274 from MicrosoftDocs/release-2025-openai-may-28-2
Release AOAI May 29 -- 11:30 AM PT
2 parents cc7fa2e + 0f8fab4 commit 5f236cf

File tree

10 files changed

+800
-0
lines changed

10 files changed

+800
-0
lines changed

articles/ai-services/openai/api-version-lifecycle.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -268,6 +268,12 @@ Azure OpenAI API latest releases:
268268
## Changes between v1 preview release and 2025-04-01-preview
269269

270270
- [v1 preview API](#api-evolution)
271+
- [Video generation support](./concepts/video-generation.md)
272+
- **NEW** Responses API features:
273+
* Remote Model Context Protocol (MCP) servers tool integration
274+
* Support for asynchronous background tasks
275+
* Encrypted reasoning items
276+
* Image generation
271277

272278
## Changes between 2025-04-01-preview and 2025-03-01-preview
273279

articles/ai-services/openai/concepts/models.md

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -244,6 +244,19 @@ Once access has been granted, you will need to create a deployment for the model
244244
|`dall-e-3` | East US<br>Australia East<br>Sweden Central|
245245
|`gpt-image-1` | West US 3 (Global Standard) <br> UAE North (Global Standard) |
246246

247+
248+
## Video generation models
249+
250+
Sora is an AI model from OpenAI that can create realistic and imaginative video scenes from text instructions. Sora is in public preview.
251+
252+
253+
254+
### Region availability
255+
256+
| Model | Region |
257+
|---|---|
258+
|`sora` | East US 2|
259+
247260
## Audio models
248261

249262
Audio models in Azure OpenAI are available via the `realtime`, `completions`, and `audio` APIs.
@@ -439,6 +452,20 @@ These models can only be used with Embedding API requests.
439452
| gpt-image-1 | 4,000 |
440453
| dall-e-3 | 4,000 |
441454

455+
# [Video Generation](#tab/standard-video-generations)
456+
457+
### Video generation models
458+
459+
| **Region** | **sora** |
460+
|:-----------------|:---------------------:|
461+
| eastus2 ||
462+
463+
| Model ID | Max Request (characters) |
464+
| --- | :---: |
465+
| sora | 4,000 |
466+
467+
468+
442469
# [Audio](#tab/standard-audio)
443470

444471
### Audio models
Lines changed: 54 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,54 @@
1+
---
2+
title: Sora video generation overview (preview)
3+
description: Learn about Sora, an AI model for generating realistic and imaginative video scenes from text instructions, including safety, limitations, and supported features.
4+
author: PatrickFarley
5+
ms.author: pafarley
6+
manager: nitinme
7+
ms.service: azure-ai-openai
8+
ms.topic: conceptual
9+
ms.date: 5/29/2025
10+
---
11+
12+
# Sora video generation (preview)
13+
14+
Sora is an AI model from OpenAI that can create realistic and imaginative video scenes from text instructions. The model is capable of generating a wide range of video content, including realistic scenes, animations, and special effects. Several video resolutions and durations are supported.
15+
16+
## Supported features
17+
18+
Sora can generate complex scenes with multiple characters, diverse motions, and detailed backgrounds. The model interprets prompts with contextual and physical world understanding, enabling accurate scene composition and character persistence across multiple shots. Sora demonstrates strong language comprehension for prompt interpretation and emotional character generation.
19+
20+
## How it works
21+
22+
Video generation is an asynchronous process. You create a job request with your text prompt and video format specifications, and the model processes the request in the background. You can check the status of the video generation job and, once it finishes, retrieve the generated video via a download URL.
23+
24+
## Best practices for prompts
25+
26+
Users should write text prompts in English or Latin script languages for the best video generation performance.
27+
28+
29+
## Limitations
30+
31+
### Content quality limitations
32+
33+
Sora might have difficulty with complex physics, causal relationships (for example, bite marks on a cookie), spatial reasoning (for example, knowing left from right), and precise time-based event sequencing such as camera movement.
34+
35+
### Technical limitations
36+
37+
Sora has some technical limitations to be aware of:
38+
39+
- Sora supports the following output resolution dimensions:
40+
480x480, 480x854, 854x480, 720x720, 720x1280, 1280x720, 1080x1080, 1080x1920, 1920x1080.
41+
- Sora supports video durations between 1 and 20 seconds.
42+
- You can request multiple video variants in a single job: for 1080p resolutions, this feature is disabled; for 720p, the maximum is two variants; for other resolutions, the maximum is four variants.
43+
- You can have two video creation jobs running at the same time. You must wait for one of the jobs to finish before you can create another.
44+
- Jobs are available for up to 24 hours after they're created. After that, you must create a new job to generate the video again.
45+
46+
## Responsible AI
47+
48+
Sora has a robust safety stack including content filtering, abuse monitoring, sensitive content blocking, and safety classifiers.
49+
50+
Sora doesn't generate scenes with acts of violence but can generate adjacent content, such as realistic war-like footage.
51+
52+
## Related content
53+
- [Video generation quickstart](../video-generation-quickstart.md)
54+
- [Image generation quickstart](../dall-e-quickstart.md)

0 commit comments

Comments
 (0)