Skip to content

Commit cce38c7

Browse files
authored
Sora 2 guide
1 parent 1bc9c22 commit cce38c7

File tree

1 file changed

+52
-11
lines changed

1 file changed

+52
-11
lines changed

articles/ai-foundry/openai/concepts/video-generation.md

Lines changed: 52 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -13,16 +13,53 @@ ms.date: 09/16/2025
1313
# Video generation with Sora (preview)
1414

1515
Sora is an AI model from OpenAI that creates realistic and imaginative video scenes from text instructions and/or input images or video. The model can generate a wide range of video content, including realistic scenes, animations, and special effects. It supports several video resolutions and durations.
16+
Azure OpenAI supports two versions of Sora:
17+
- Sora (or Sora 1): Azure OpenAI–specific implementation released as an API in early preview.
18+
- Sora 2: The latest OpenAI-based API, now being adapted for Azure OpenAI, with Sora 2 Pro planned for future release.
19+
20+
## Overview
21+
- Modalities: text → video, image → video, video (generated) → video
22+
- Audio: Sora 2 supports audio generation in output videos (similar to the Sora app).
23+
- Remix: Sora 2 introduces the ability to remix existing videos by making targeted adjustments instead of regenerating from scratch.
24+
25+
## Sora 1 vs. Sora 2
26+
27+
| Aspect | **Sora 1 (Azure OpenAI)** | **Sora 2 (OpenAI-based API)** |
28+
|--------|-----------------------------|-------------------------------|
29+
| **Model type** | Azure-specific API implementation | Adapts OpenAI’s latest Sora API |
30+
| **Availability** | Available exclusively on Azure OpenAI (Preview) | Rolling out on Azure; **Sora 2 Pro** coming later |
31+
| **Modalities supported** | text → video, image → video, video → video | text → video, image → video, **video (generated) → video** |
32+
| **Audio generation** | ❌ Not supported | ✅ Supported in outputs |
33+
| **Remix capability** | ❌ Not supported | ✅ Supported — make targeted edits to existing videos |
34+
| **API behavior** | Uses Azure-specific API schema | Aligns with OpenAI’s native Sora 2 schema |
35+
| **Performance & fidelity** | Early preview; limited realism and motion range | Enhanced realism, physics, and temporal consistency |
36+
| **Intended use** | Enterprise preview deployments | Broader developer availability with improved API parity |
37+
38+
> **Note:** *Sora 2 Pro* will add higher-quality generation options, advanced camera controls, and improved motion rendering once available on Azure OpenAI.
39+
40+
## Sora 2 API
41+
Provides 5 endpoints, each with distinct capabilities.
42+
• Create Video: Start a new render job from a prompt, with optional reference inputs or a remix id.
43+
• Get Video Status: Retrieve the current state of a render job and monitor its progress
44+
• Download Video: Fetch the finished MP4 once the job is completed.
45+
• List Videos: Enumerate your videos with pagination for history, dashboards, or housekeeping.
46+
• Delete Videos: Delete an individual video id from Azure OpenAI’s storage
47+
48+
### API Parameters
49+
50+
| Parameter | Type | **Sora 2** | **Sora 2 Pro (Coming Soon)** |
51+
|------------|------|------------|-------------------------------|
52+
| **Prompt** | String (required) | Natural-language description of the shot. Include shot type, subject, action, setting, lighting, and any desired camera motion to reduce ambiguity. Keep it *single-purpose* for best adherence. | Same as Sora 2 |
53+
| **Model** | String (optional) | `Sora-2` (default) | `Sora-2-pro` |
54+
| **Size (Output resolution in width × height)** | String (optional) | Portrait: `720×1280` <br> Landscape: `1280×720` <br> **Default:** 720×1280 | Portrait: `720×1280`, `1024×1808` <br> Landscape: `1280×720`, `1808×1024` <br> **Default:** 720×1280 |
55+
| **Seconds** | String (optional) | `4 / 8 / 12` <br> **Default:** 4 | Same as Sora 2 |
56+
| **Input reference** | File (optional) | Single reference image used as a visual anchor for the first frame. <br> Accepted MIME types: `image/jpeg`, `image/png`, `image/webp`. Must match size exactly. | Same as Sora 2 |
57+
| **Remix_video_id** | String (optional) | ID of a previously completed video (e.g., `video_...`) to reuse structure, motion, and framing. | Same as Sora 2 |
58+
59+
The API is the same as the [OAI API]([url](https://platform.openai.com/docs/guides/video-generation)) , minus the following two things:
60+
• In AOAI API, you have to replace the model's name, by the name of the deployment. For example, "sora2-
61+
test"
1662

17-
## Supported features
18-
19-
Sora can generate complex scenes with multiple characters, diverse motions, and detailed backgrounds.
20-
21-
**Text to video**: The model interprets prompts with contextual and physical world understanding, enabling accurate scene composition and character persistence across multiple shots. Sora demonstrates strong language comprehension for prompt interpretation and emotional character generation.
22-
23-
**Image to video**: Sora generates video content from a still image. You can specify where in the generated video the image appears (it doesn't need to be the first frame) and which region of the image to use.
24-
25-
**Video to video**: Sora generates new video content from an existing video clip. You can specify where in the generated video the input video appears (it doesn't need to be the beginning).
2663

2764
## How it works
2865

@@ -39,9 +76,13 @@ Write text prompts in English or other Latin script languages for the best video
3976

4077
Sora might have difficulty with complex physics, causal relationships (for example, bite marks on a cookie), spatial reasoning (for example, knowing left from right), and precise time-based event sequencing such as camera movement.
4178

42-
### Technical limitations
79+
### Sora 2 Technical Limitations
80+
81+
- Please Sora 2 API details above
82+
- Jobs are available for up to 24 hours after they're created. After that, you must create a new job to generate the video again.
83+
- You can have two video creation jobs running at the same time. You must wait for one of the jobs to finish before you can create another.
4384

44-
Sora has some technical limitations to be aware of:
85+
### Sora 1 Technical limitations
4586

4687
- Sora supports the following output resolution dimensions:
4788
480x480, 480x854, 854x480, 720x720, 720x1280, 1280x720, 1080x1080, 1080x1920, 1920x1080.

0 commit comments

Comments
 (0)