|
| 1 | +--- |
| 2 | +title: Sora video generation overview |
| 3 | +description: Learn about Sora, an AI model for generating realistic and imaginative video scenes from text instructions, including safety, limitations, and supported features. |
| 4 | +author: PatrickFarley |
| 5 | +ms.author: pafarley |
| 6 | +manager: nitinme |
| 7 | +ms.service: azure-ai-openai |
| 8 | +ms.topic: conceptual |
| 9 | +ms.date: 05/22/2025 |
| 10 | +--- |
1 | 11 |
|
| 12 | +# Sora video generation |
2 | 13 |
|
3 |
| -Sora does not respond to all information provided in the prompt, but is still capable of creating realistic clips |
| 14 | +Sora is an AI model from OpenAI that can create realistic and imaginative video scenes from text instructions. The model is capable of generating a wide range of video content, including realistic scenes, animations, and special effects. A variety of video resolutions and durations are supported. |
4 | 15 |
|
5 |
| -Users must use Sora in English or Latin script languages for best user experience |
| 16 | +## Supported features |
6 | 17 |
|
7 |
| -Sora does not generate scenes with acts of violence but can demonstrate realistic war-like footage |
| 18 | +Sora can generate complex scenes with multiple characters, diverse motions, and detailed backgrounds. The model interprets prompts with contextual and physical world understanding, enabling accurate scene composition and character persistence across multiple shots. Sora demonstrates strong language comprehension for prompt interpretation and emotional character generation. |
8 | 19 |
|
9 |
| -Commonly filtered names such as Emma Watson, Obama, Biden, etc. are not filtered. |
10 | 20 |
|
11 |
| -Sora does respond to prompts that can produce videos of lingerie – for example, using Japanese allowed a previously filtered prompt (in English) to be run |
12 | 21 |
|
13 | 22 |
|
14 |
| -RAI: Sora has a robust safety stack including a moderation stack with prompt rewrites, content filtering, abuse monitoring, sensitive content blocking, and safety classifiers. Provenance will be added during private preview. |
15 | 23 |
|
16 |
| -Sora API will support the following resolution dimensions: 480x480, 480x854, 720x720, 720x1280, 1080x1080, 1920x1080 |
| 24 | +Sora does not respond to all information provided in the prompt, but is still capable of creating realistic clips |
| 25 | +## Best practices for prompts |
17 | 26 |
|
18 |
| -5,10,15,20 seconds of video duration. For 1080p, max duration is 10s |
| 27 | +Users should write text prompts in English or Latin script languages for the best video generation performance. |
19 | 28 |
|
20 |
| -context length same as gpt-image-1 |
| 29 | + |
| 30 | + |
| 31 | + |
| 32 | + |
| 33 | +## Responsible AI |
| 34 | + |
| 35 | +Sora has a robust safety stack including a moderation stack with prompt rewrites, content filtering, abuse monitoring, sensitive content blocking, and safety classifiers. |
| 36 | + |
| 37 | +Sora does not generate scenes with acts of violence but can demonstrate realistic war-like footage |
| 38 | + |
| 39 | +Commonly filtered names such as Emma Watson, Obama, Biden, etc. are not filtered. |
21 | 40 |
|
22 | 41 | ## Limitations
|
23 | 42 |
|
| 43 | +### Content quality limitations |
| 44 | + |
| 45 | +Sora may have difficulty with complex physics, causal relationships (for example, bite marks on a cookie), spatial reasoning (for example, knowing left vs. right), and precise time-base event sequencing such as camera movement. |
| 46 | + |
| 47 | +### Technical limitations |
| 48 | + |
| 49 | +Sora supports the following output resolution dimensions: 480x480, 480x854, 720x720, 720x1280, 1080x1080, 1920x1080. |
| 50 | + |
| 51 | +Sora supports the following video durations: 5, 10, 15, and 20 seconds. The maximum duration for 1080x videos is 10 seconds. |
| 52 | + |
| 53 | + |
| 54 | +Max variants on 1080p =1; 720 =2; otherwise 4 |
24 | 55 |
|
25 |
| -Max variants on 1080p =1; 720 =2; otherwise 4 |
26 |
| -Limiting 2 pending tasks per customer |
| 56 | +Limiting 2 pending tasks per customer |
0 commit comments