You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Content Understanding service is continuously updated. Bookmark this page to stay informed about the latest features and samples.
18
18
19
+
## April 2025
20
+
21
+
The Content Understanding **2024-12-01-preview** REST API is now available. This update for preview API introduces the following updates and enhanced capabilities:
22
+
23
+
***General improvements**. For all modality, to request an increase from current limits, contact us at `[email protected]`.
24
+
***Prebuilt invoice template**. The invoice template is now customizable. Once you select the invoice template, you can access a predefined list of fields that can be tailored to your specific needs by adding or removing fields.
25
+
***Generative and classification fields**
26
+
* Both generative and classify fields are now supported for documents modality.
27
+
* You can now utilize the REST endpoint/Studio to define generative and classification fields with zero-shot outputs for documents. This feature enables you to generate summaries, infer results, and classify individual documents across multiple files.
28
+
* Multiple analyzers can be invoked to process individual files.
29
+
***Video modality**
30
+
* Latency improvement for video processing resulting in 50% lower latency.
31
+
* Expanded output types to add support for `Object` and `Arrays`
32
+
* Added support for video files provided via S3 presigned URL ingestion
33
+
* Improved video segmentation to semantically segment especially when no shot edits exist in the video
34
+
***Audio modality**
35
+
* API now supports the field type: `group`.
36
+
***Text modality**
37
+
* API support for the field type: `group`.
38
+
***User experience improvements**
39
+
* Added functionality to download and upload schema configurations during schema definition.
40
+
* Enhanced file labeling and analyzer building processes.
41
+
* Add download code samples for quick setup added.
42
+
19
43
## November 2024
20
44
Welcome! The Azure AI Content Understanding API version `2024-12-01-preview` is now in public preview. This version allows you to generate a structured representation of content tailored to specific tasks from various modalities or formats. Content Understanding uses a defined schema to extract content suitable for processing by large language models and subsequent applications.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/models.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -228,7 +228,7 @@ The image generation models generate images from text prompts that the user prov
228
228
229
229
**For access to `gpt-image-1` registration is required, and access will be granted based on Microsoft's eligibility criteria**. Customers who have access to other limited access models will still need to request access for this model.
230
230
231
-
Request access: [`computer-use-preview` limited access model application](https://aka.ms/oai/gptimage1access)
231
+
Request access: [`gpt-image-1` limited access model application](https://aka.ms/oai/gptimage1access)
232
232
233
233
Once access has been granted, you will need to create a deployment for the model.
234
234
@@ -237,7 +237,7 @@ Once access has been granted, you will need to create a deployment for the model
237
237
| Model | Region |
238
238
|---|---|
239
239
|`dall-e-3`| East US<br>Australia East<br>Sweden Central|
240
-
|`gpt-image-1`| West US 2 (Global Standard) <br> UAE North (Global Standard) |
240
+
|`gpt-image-1`| West US 3 (Global Standard) <br> UAE North (Global Standard) |
| image | file | The input image to edit. Must be a valid image URL or base64-encoded image. tbd | Yes ||
1360
+
| image | file | The input image to edit. Must be a valid image URL or base64-encoded image. | Yes ||
1361
1361
| n | integer | The number of images to generate. | No | 1 |
1362
1362
| prompt | string | A text description of how the input image should be edited. The maximum length is 4000 characters. | Yes ||
1363
1363
| mask | file | A mask image to define the area of the input image that the model should edit, using fully transparent pixels (alpha of zero) in those areas. Must be a valid image URL or base64-encoded image. | No ||
@@ -6195,7 +6195,7 @@ The style of the generated images.
6195
6195
6196
6196
| Name | Type | Description | Required | Default |
| image | file | The input image to edit. Must be a valid image URL or base64-encoded image. tbd | Yes ||
6198
+
| image | file | The input image to edit. Must be a valid image URL or base64-encoded image. | Yes ||
6199
6199
| n | integer | The number of images to generate. | No | 1 |
6200
6200
| prompt | string | A text description of how the input image should be edited. The maximum length is 4000 characters. | Yes ||
6201
6201
| mask | file | A mask image to define the area of the input image that the model should edit, using fully transparent pixels (alpha of zero) in those areas. Must be a valid image URL or base64-encoded image. | No ||
0 commit comments