You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/content-understanding/whats-new.md
+32-20Lines changed: 32 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,35 +17,47 @@ ms.custom:
17
17
Azure AI Content Understanding service is updated on an ongoing basis. Bookmark this page to stay up to date with release notes, feature enhancements, and our newest documentation.
18
18
19
19
## May 2025
20
+
20
21
The Azure AI Content Understanding **`2025-05-01-preview`** REST API is now available. This update introduces the following updates and enhanced capabilities:
21
22
22
-
***Modes for documents**: With the **`2025-05-01-preview`** release, we introduce two modes: `standard` and `pro`. Content Understanding pro mode adds reasoning, support for multiple input documents, the ability to configure an external knowledge base for linking and validation. These features enable agentic processes for automating complex tasks requiring human effort. The `pro` mode is currently limited to the document analyzer, enables reasoning capabilities that can infer complicated outputs given multiple reference documents (for example, insurance policy documents) and multiple input documents (for example, forms and supporting documents that are case-specific)
23
+
***Processing modes**: With the **`2025-05-01-preview`** release, we introduce two modes: `standard` and `pro`. The default mode for all analyzers is `standard`. Content Understanding pro mode adds reasoning, support for multiple input documents, the ability to configure an external knowledge base for linking, enrichment, and validation. These features enable automating complex tasks to extend field extraction capabilities to include tasks that required custom code or human effort. The `pro` mode is currently limited to documents as inputs. Common challenges that the pro mode addresses include aggregating a schema across content from different input files. It also involves validating results across documents. Additionally, it uses external knowledge, such as guidelines, standard operating procedures, and other context, to generate an output schema. Learn more about the [pro mode](concepts/standard-pro-modes.md).
24
+
25
+
***Improvements to document processing** :
26
+
27
+
***Document classification and splitting** with a [Classification API](concepts/classifier.md). This API supports classifying and logical splitting a single file containing multiple documents with optional routing to field extraction analyzers. The API enables you to define a workflow to classify and split a file into multiple logical documents and route the individual documents to a downstream field extraction model in a single API call.
28
+
* Improvements in **content extraction**:
29
+
* Added support for extracting table spanning multiple pages as a single logical table. Learn more about [structure extraction updates in documents](document/elements.md).
30
+
* Selection mark support for checkmark and radio buttons as unicode characters. Learn more about [structure extraction updates in documents](document/elements.md).
31
+
* Bar code extraction as part of the default content extraction along with `OCR`. Learn more about [structure extraction updates in documents](document/elements.md).
32
+
* Support for embedded figures in Markdown as base64 string format output. Learn more about [structure extraction updates in documents](document/elements.md).
33
+
* Improvements in **field extraction**
34
+
* Confidence score improvements with better grounding results for extractive fields.
35
+
* New file format support extended for following document types `.rtf`,`.txt`,`.xml`,`.json`, `.msg`,`.eml`,`.csv`, and `.xlsx`.
36
+
37
+
38
+
***Improvements to video processing**:
39
+
40
+
* Added Support for whole video fields. Learn more about [video processing improvements](video/overview.md#segmentation-mode).
41
+
* Added Support for video chapters via segmentation. Learn more about [video processing improvements](video/overview.md#segmentation-mode).
42
+
* Added Support for face identification on extracted face thumbnails. The identity enhances the description and downstream tasks like search and retrieval. Learn more about [face detection in videos](video/overview.md#content-extraction---grouping-and-identification)
43
+
* Added Support for disabling face blurring in analyzer configuration. Learn more about [video processing improvements](video/overview.md#field-extraction--face-description).
44
+
45
+
***Improvements in audio processing**:
23
46
24
-
***Document modality improvements** :
47
+
* Support for multi-speaker call center role detection to allow detection of multiple speakers.
25
48
26
-
* Improvements in content extraction:
27
-
* Support for table spanning multiple pages getting extracted as a single logical table.
28
-
* Selection mark support for checkmark and radio buttons as unicode characters.
29
-
* Bar code extraction as part of the default content extraction along with `OCR`.
30
-
* Support for embedded figures in Markdown as base64 string format output.
31
-
* Confidence score improvements with better grounding results for extractive fields.
32
-
* New file format support extended for following document types `.rtf`,`.txt`,`.xml`,`.json`, `.msg`,`.eml`,`.csv`, and `.xlsx`.
33
-
* Classification API for documents only. This API supports classifying and splitting a single file containing multiple documents with optional routing to field extraction analyzers.
49
+
***Face API preview**:
34
50
35
-
***Video Modality improvements**:
51
+
This release adds new face detection and recognition capabilities to Content Understanding. You can create a database of faces and recognize the faces in the processed content.
52
+
* Detection, recognition, and enrollment of faces. Learn more about [detecting and recognizing faces](face/overview.md).
36
53
37
-
* Support for whole video fields.
38
-
* Support for video chapters via segmentation.
39
-
* Support for face identification on extracted face thumbnails. The identity enhances the description and downstream tasks like search and retrieval.
40
-
* Support for disabling face blurring in analyzer configuration.
54
+
***Billing Changes**:
41
55
42
-
***Audio Modality improvements**: Support for multi-speaker call center role detection to allow detection of multiple speakers.
56
+
* New simpler pricing model that lowers processing costs when compared to the **2024-12-01-preview** REST API across many of the features. Learn more about the [updated pricing model](https://azure.microsoft.com/pricing/details/content-understanding/)
43
57
44
-
***Face API preview**: Detection, recognition, and enrollment of faces.
58
+
***User experience improvements**:
45
59
46
-
***Billing Changes**: New simpler pricing model that lowers costs from the **2024-12-01-preview** REST API across all modalities.
47
-
48
-
***User experience improvements**: To improve latency issues, the UX experience is refreshed with ease-of-use and quick try-out experience as part of [Azure AI Foundry](https://aka.ms/cu-landing)
60
+
The [Azure AI Foundry](https://ai.azure.com/) experience continues to improve with a streamlined project creation flow, improved performance experience, and a try-out experience. Get started with Content Understanding in the [Azure AI Foundry](https://aka.ms/cu-landing) today.
0 commit comments