Merge pull request #7003 from PatrickFarley/imagen

prmerger-automator[bot] · web-flow · commit 6dd70dba93ec · 2025-09-10T16:58:44.000Z
add video gen
diff --git a/articles/ai-foundry/openai/concepts/video-generation.md b/articles/ai-foundry/openai/concepts/video-generation.md
@@ -11,7 +11,7 @@ ms.date: 5/29/2025
 
 # Sora video generation (preview)
 
-Sora is an AI model from OpenAI that can create realistic and imaginative video scenes from text instructions. The model is capable of generating a wide range of video content, including realistic scenes, animations, and special effects. Several video resolutions and durations are supported.
+Sora is an AI model from OpenAI that can create realistic and imaginative video scenes from text instructions and/or input images or video. The model is capable of generating a wide range of video content, including realistic scenes, animations, and special effects. Several video resolutions and durations are supported.
 
 ## Supported features
 
@@ -21,7 +21,7 @@ Sora can generate complex scenes with multiple characters, diverse motions, and
 
 **Image to video**: Sora can generate video content from a still image. You can specify where in the generated video the image appears (it doesn't need to be the first frame) and which region of the image to use.
 
-
+**Video to video**: Sora can generate new video content from an existing video clip. You can specify where in the generated video the input video appears (it doesn't need to be the beginning).
 
 ## How it works
 
@@ -44,10 +44,12 @@ Sora has some technical limitations to be aware of:
 
 - Sora supports the following output resolution dimensions: 
 480x480, 480x854, 854x480, 720x720, 720x1280, 1280x720, 1080x1080, 1080x1920, 1920x1080.
-- Sora supports video durations between 1 and 20 seconds.
+- Sora can produce videos between 1 and 20 seconds long.
 - You can request multiple video variants in a single job: for 1080p resolutions, this feature is disabled; for 720p, the maximum is two variants; for other resolutions, the maximum is four variants.
 - You can have two video creation jobs running at the same time. You must wait for one of the jobs to finish before you can create another.
 - Jobs are available for up to 24 hours after they're created. After that, you must create a new job to generate the video again.
+- Up to two images can be used as input (the generated video interpolates content between them).
+- One video up to five seconds can be used as input.
 
 ## Responsible AI
 
diff --git a/articles/ai-foundry/openai/includes/video-generation-intro.md b/articles/ai-foundry/openai/includes/video-generation-intro.md
@@ -7,6 +7,6 @@ ms.topic: include
 ms.date: 5/29/2025
 ---
 
-In this quickstart, you generate video clips using the Azure OpenAI service. The example uses the Sora model, which is a video generation model that creates realistic and imaginative video scenes from text instructions and/or image inputs. This guide shows you how to create a video generation job, poll for its status, and retrieve the generated video.
+In this quickstart, you generate video clips using the Azure OpenAI service. The example uses the Sora model, which is a video generation model that creates realistic and imaginative video scenes from text instructions and/or image or video inputs. This guide shows you how to create a video generation job, poll for its status, and retrieve the generated video.
 
 For more information on video generation, see [Video generation concepts](../concepts/video-generation.md).
diff --git a/articles/ai-foundry/openai/includes/video-generation-rest.md b/articles/ai-foundry/openai/includes/video-generation-rest.md
@@ -244,6 +244,92 @@ You can generate a video with the Sora model by creating a video generation job,
     else:
         raise Exception(f"Job didn't succeed. Status: {status}")
     ```
+
+
+
+    ## [Video prompt](#tab/video-prompt)
+
+    Replace the `"file_name"` field in `"inpaint_items"` with the name of your input video file. Also replace the construction of the `files` array, which associates the path to the actual file with the filename that the API uses.
+
+    Use the `"crop_bounds"` data (image crop distances, from each direction, as a fraction of the total frame dimensions) to specify which part of the video frame should be used in video generation.
+
+    You can optionally set the `"frame_index"` to the frame in the generated video where your input video should start (the default is 0, the beginning).
+
+
+    ```python
+    # 1. Create a video generation job with video inpainting (multipart upload)
+    create_url = f"{endpoint}/openai/v1/video/generations/jobs?api-version=preview"
+    
+    # Flatten the body for multipart/form-data
+    data = {
+        "prompt": "A serene forest scene transitioning into autumn",
+        "height": str(1080),
+        "width": str(1920),
+        "n_seconds": str(10),
+        "n_variants": str(1),
+        "model": "sora",
+        # inpaint_items must be JSON string
+        "inpaint_items": json.dumps([
+            {
+                "frame_index": 0,
+                "type": "video",
+                "file_name": "dog_swimming.mp4",
+                "crop_bounds": {
+                    "left_fraction": 0.1,
+                    "top_fraction": 0.1,
+                    "right_fraction": 0.9,
+                    "bottom_fraction": 0.9
+                }
+            }
+        ])
+    }
+    
+    # Replace with your own video file path
+    with open("dog_swimming.mp4", "rb") as video_file:
+        files = [
+            ("files", ("dog_swimming.mp4", video_file, "video/mp4"))
+        ]
+        multipart_headers = {k: v for k, v in headers.items() if k.lower() != "content-type"}
+        response = requests.post(
+            create_url,
+            headers=multipart_headers,
+            data=data,
+            files=files
+        )
+    
+    if not response.ok:
+        print("Error response:", response.status_code, response.text)
+        response.raise_for_status()
+    print("Full response JSON:", response.json())
+    job_id = response.json()["id"]
+    print(f"Job created: {job_id}")
+    
+    # 2. Poll for job status
+    status_url = f"{endpoint}/openai/v1/video/generations/jobs/{job_id}?api-version=preview"
+    status = None
+    while status not in ("succeeded", "failed", "cancelled"):
+        time.sleep(5)
+        status_response = requests.get(status_url, headers=headers).json()
+        status = status_response.get("status")
+        print(f"Job status: {status}")
+    
+    # 3. Retrieve generated video
+    if status == "succeeded":
+        generations = status_response.get("generations", [])
+        if generations:
+            generation_id = generations[0].get("id")
+            video_url = f"{endpoint}/openai/v1/video/generations/{generation_id}/content/video?api-version=preview"
+            video_response = requests.get(video_url, headers=headers)
+            if video_response.ok:
+                output_filename = "output.mp4"
+                with open(output_filename, "wb") as file:
+                    file.write(video_response.content)
+                    print(f'✅ Generated video saved as "{output_filename}"')
+        else:
+            raise Exception("No generations found in job result.")
+    else:
+        raise Exception(f"Job didn't succeed. Status: {status}")
+    ```
     ---
 
 
diff --git a/articles/ai-foundry/openai/whats-new.md b/articles/ai-foundry/openai/whats-new.md
@@ -18,12 +18,15 @@ ms.custom:
 
 This article provides a summary of the latest releases and major documentation updates for Azure OpenAI.
 
+## Sora video-to-video support 
+
+The Sora model from OpenAI now supports video-to-video generation. You can provide a short video as input to generate a new, longer video that incorporates the input video. See the [quickstart](./video-generation-quickstart.md) to get started.
+
 ## August 2025
 
 ### Sora image-to-video support
 
-The Sora model from OpenAI now supports image-to-video generation. You can provide an image as input to the model to generate a video that incorporates the content of the image. You can also specify the frame of the video in which the image should appear: it doesn't need to be the beginning.
-
+The Sora model from OpenAI now supports image-to-video generation. You can provide an image as input to the model to generate a video that incorporates the content of the image. You can also specify the frame of the video in which the image should appear: it doesn't need to be the beginning. See the [quickstart](./video-generation-quickstart.md) to get started.
 
 Sora is now available in the Sweden Central region as well as East US 2.