Skip to content

Commit 37f7609

Browse files
authored
Merge pull request #4999 from laujan/vinod-4994-whats-new
Vinod 4994 whats new
2 parents e57c03c + a9a17dd commit 37f7609

File tree

1 file changed

+32
-20
lines changed

1 file changed

+32
-20
lines changed

articles/ai-services/content-understanding/whats-new.md

Lines changed: 32 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -17,35 +17,47 @@ ms.custom:
1717
Azure AI Content Understanding service is updated on an ongoing basis. Bookmark this page to stay up to date with release notes, feature enhancements, and our newest documentation.
1818

1919
## May 2025
20+
2021
The Azure AI Content Understanding **`2025-05-01-preview`** REST API is now available. This update introduces the following updates and enhanced capabilities:
2122

22-
* **Modes for documents**: With the **`2025-05-01-preview`** release, we introduce two modes: `standard` and `pro`. Content Understanding pro mode adds reasoning, support for multiple input documents, the ability to configure an external knowledge base for linking and validation. These features enable agentic processes for automating complex tasks requiring human effort. The `pro` mode is currently limited to the document analyzer, enables reasoning capabilities that can infer complicated outputs given multiple reference documents (for example, insurance policy documents) and multiple input documents (for example, forms and supporting documents that are case-specific)
23+
* **Processing modes**: With the **`2025-05-01-preview`** release, we introduce two modes: `standard` and `pro`. The default mode for all analyzers is `standard`. Content Understanding pro mode adds reasoning, support for multiple input documents, the ability to configure an external knowledge base for linking, enrichment, and validation. These features enable automating complex tasks to extend field extraction capabilities to include tasks that required custom code or human effort. The `pro` mode is currently limited to documents as inputs. Common challenges that the pro mode addresses include aggregating a schema across content from different input files. It also involves validating results across documents. Additionally, it uses external knowledge, such as guidelines, standard operating procedures, and other context, to generate an output schema. Learn more about the [pro mode](concepts/standard-pro-modes.md).
24+
25+
* **Improvements to document processing** :
26+
27+
* **Document classification and splitting** with a [Classification API](concepts/classifier.md). This API supports classifying and logical splitting a single file containing multiple documents with optional routing to field extraction analyzers. The API enables you to define a workflow to classify and split a file into multiple logical documents and route the individual documents to a downstream field extraction model in a single API call.
28+
* Improvements in **content extraction**:
29+
* Added support for extracting table spanning multiple pages as a single logical table. Learn more about [structure extraction updates in documents](document/elements.md).
30+
* Selection mark support for checkmark and radio buttons as unicode characters. Learn more about [structure extraction updates in documents](document/elements.md).
31+
* Bar code extraction as part of the default content extraction along with `OCR`. Learn more about [structure extraction updates in documents](document/elements.md).
32+
* Support for embedded figures in Markdown as base64 string format output. Learn more about [structure extraction updates in documents](document/elements.md).
33+
* Improvements in **field extraction**
34+
* Confidence score improvements with better grounding results for extractive fields.
35+
* New file format support extended for following document types `.rtf`,`.txt`,`.xml`,`.json`, `.msg`,`.eml`,`.csv`, and `.xlsx`.
36+
37+
38+
* **Improvements to video processing**:
39+
40+
* Added Support for whole video fields. Learn more about [video processing improvements](video/overview.md#segmentation-mode).
41+
* Added Support for video chapters via segmentation. Learn more about [video processing improvements](video/overview.md#segmentation-mode).
42+
* Added Support for face identification on extracted face thumbnails. The identity enhances the description and downstream tasks like search and retrieval. Learn more about [face detection in videos](video/overview.md#content-extraction---grouping-and-identification)
43+
* Added Support for disabling face blurring in analyzer configuration. Learn more about [video processing improvements](video/overview.md#field-extraction--face-description).
44+
45+
* **Improvements in audio processing**:
2346

24-
* **Document modality improvements** :
47+
* Support for multi-speaker call center role detection to allow detection of multiple speakers.
2548

26-
* Improvements in content extraction:
27-
* Support for table spanning multiple pages getting extracted as a single logical table.
28-
* Selection mark support for checkmark and radio buttons as unicode characters.
29-
* Bar code extraction as part of the default content extraction along with `OCR`.
30-
* Support for embedded figures in Markdown as base64 string format output.
31-
* Confidence score improvements with better grounding results for extractive fields.
32-
* New file format support extended for following document types `.rtf`,`.txt`,`.xml`,`.json`, `.msg`,`.eml`,`.csv`, and `.xlsx`.
33-
* Classification API for documents only. This API supports classifying and splitting a single file containing multiple documents  with optional routing to field extraction analyzers.
49+
* **Face API preview**:
3450

35-
* **Video Modality improvements**:
51+
This release adds new face detection and recognition capabilities to Content Understanding. You can create a database of faces and recognize the faces in the processed content.
52+
* Detection, recognition, and enrollment of faces. Learn more about [detecting and recognizing faces](face/overview.md).
3653

37-
* Support for whole video fields.
38-
* Support for video chapters via segmentation.
39-
* Support for face identification on extracted face thumbnails. The identity enhances the description and downstream tasks like search and retrieval.
40-
* Support for disabling face blurring in analyzer configuration.
54+
* **Billing Changes**:
4155

42-
* **Audio Modality improvements**: Support for multi-speaker call center role detection to allow detection of multiple speakers.
56+
* New simpler pricing model that lowers processing costs when compared to the **2024-12-01-preview** REST API across many of the features. Learn more about the [updated pricing model](https://azure.microsoft.com/pricing/details/content-understanding/)
4357

44-
* **Face API preview**: Detection, recognition, and enrollment of faces.
58+
* **User experience improvements**:
4559

46-
* **Billing Changes**: New simpler pricing model that lowers costs from the **2024-12-01-preview** REST API across all modalities.
47-
48-
* **User experience improvements**: To improve latency issues, the UX experience is refreshed with ease-of-use and quick try-out experience as part of [Azure AI Foundry](https://aka.ms/cu-landing)
60+
The [Azure AI Foundry](https://ai.azure.com/) experience continues to improve with a streamlined project creation flow, improved performance experience, and a try-out experience. Get started with Content Understanding in the [Azure AI Foundry](https://aka.ms/cu-landing) today.
4961

5062
## April 2025
5163

0 commit comments

Comments
 (0)