Skip to content

Commit e84373d

Browse files
authored
Update whats-new.md
Updated formatting and added links
1 parent 36d15c2 commit e84373d

File tree

1 file changed

+31
-20
lines changed

1 file changed

+31
-20
lines changed

articles/ai-services/content-understanding/whats-new.md

Lines changed: 31 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -19,33 +19,44 @@ Azure AI Content Understanding service is updated on an ongoing basis. Bookmark
1919
## May 2025
2020
The Azure AI Content Understanding **`2025-05-01-preview`** REST API is now available. This update introduces the following updates and enhanced capabilities:
2121

22-
* **Modes for documents**: With the **`2025-05-01-preview`** release, we introduce two modes: `standard` and `pro`. Content Understanding pro mode adds reasoning, support for multiple input documents, the ability to configure an external knowledge base for linking and validation. These features enable agentic processes for automating complex tasks requiring human effort. The `pro` mode is currently limited to the document analyzer, enables reasoning capabilities that can infer complicated outputs given multiple reference documents (for example, insurance policy documents) and multiple input documents (for example, forms and supporting documents that are case-specific)
22+
* **Processing modes**: With the **`2025-05-01-preview`** release, we introduce two modes: `standard` and `pro`. The default mode for all analyzers is `standard`. Content Understanding pro mode adds reasoning, support for multiple input documents, the ability to configure an external knowledge base for linking, enrichment and validation. These features enable automating complex tasks requiring that extend the field extraction capability to cover tasks that would have required custom code or human effort. The `pro` mode is currently limited to documents as inputs, this will be extended to support other types of content soon! Common challenges that the pro mode addresses is aggregating a schema across content from different input files, validating results across documents and using extrenal knowledge (guidelines, standard operating procedures and other context) to generate a output schema. Learn more about the [pro mode](concepts/standard-pro-modes.md).
2323

24-
* **Document modality improvements** :
25-
26-
* Improvements in content extraction:
27-
* Support for table spanning multiple pages getting extracted as a single logical table.
28-
* Selection mark support for checkmark and radio buttons as unicode characters.
29-
* Bar code extraction as part of the default content extraction along with `OCR`.
30-
* Support for embedded figures in Markdown as base64 string format output.
31-
* Confidence score improvements with better grounding results for extractive fields.
32-
* New file format support extended for following document types `.rtf`,`.txt`,`.xml`,`.json`, `.msg`,`.eml`,`.csv`, and `.xlsx`.
33-
* Classification API for documents only. This API supports classifying and splitting a single file containing multiple documents  with optional routing to field extraction analyzers.
24+
* **Improvements to document processing** :
25+
26+
* **Document classification and splitting** with a [Classification API](concepts/classifier.md). This API supports classifying and logicall splitting a single file containing multiple documents with optional routing to field extraction analyzers. This enables you to define a workflow to classify and split a file into multiple logical documents and route the individual documents to a downstream field extraxtion model in a single API call.
27+
* Improvements in **content extraction**:
28+
* Added support for extracting table spanning multiple pages as a single logical table. Learn more about [structure extraction updates in documents](document/elements.md).
29+
* Selection mark support for checkmark and radio buttons as unicode characters. Learn more about [structure extraction updates in documents](document/elements.md).
30+
* Bar code extraction as part of the default content extraction along with `OCR`. Learn more about [structure extraction updates in documents](document/elements.md).
31+
* Support for embedded figures in Markdown as base64 string format output. Learn more about [structure extraction updates in documents](document/elements.md).
32+
* Improvements in **field extraction**
33+
* Confidence score improvements with better grounding results for extractive fields.
34+
* New file format support extended for following document types `.rtf`,`.txt`,`.xml`,`.json`, `.msg`,`.eml`,`.csv`, and `.xlsx`.
35+
36+
37+
* **Improvements to video processing**:
38+
39+
* Added Support for whole video fields. Learn more about [video processing improvements](video/overview.md#segmentation-mode).
40+
* Added Support for video chapters via segmentation. Learn more about [video processing improvements](video/overview.md#segmentation-mode).
41+
* Added Support for face identification on extracted face thumbnails. The identity enhances the description and downstream tasks like search and retrieval. Learn more about [face detection in videos](video/overview.md#content-extraction---grouping-and-identification)
42+
* Added Support for disabling face blurring in analyzer configuration. Learn more about [video processing improvements](video/overview.md#field-extraction--face-description).
3443

35-
* **Video Modality improvements**:
44+
* **Improvements in audio processing**:
45+
46+
* Support for multi-speaker call center role detection to allow detection of multiple speakers.
3647

37-
* Support for whole video fields.
38-
* Support for video chapters via segmentation.
39-
* Support for face identification on extracted face thumbnails. The identity enhances the description and downstream tasks like search and retrieval.
40-
* Support for disabling face blurring in analyzer configuration.
48+
* **Face API preview**:
4149

42-
* **Audio Modality improvements**: Support for multi-speaker call center role detection to allow detection of multiple speakers.
50+
This release adds new face detection and recognition capabilities to Content Understanding. You can create a database of faces and recognize the faces in the processed content.
51+
* Detection, recognition, and enrollment of faces. Learn more about [detecting and recognizing faces](face/overview.md).
4352

44-
* **Face API preview**: Detection, recognition, and enrollment of faces.
53+
* **Billing Changes**:
4554

46-
* **Billing Changes**: New simpler pricing model that lowers costs from the **2024-12-01-preview** REST API across all modalities.
55+
* New simpler pricing model that lowers processing costs when compated to the **2024-12-01-preview** REST API across many of the features. Learn more about the [updated pricing model](https://azure.microsoft.com/pricing/details/content-understanding/)
4756

48-
* **User experience improvements**: To improve latency issues, the UX experience is refreshed with ease-of-use and quick try-out experience as part of [Azure AI Foundry](https://aka.ms/cu-landing)
57+
* **User experience improvements**:
58+
59+
The [Azure AI Foundry](https://ai.azure.com/) experience continues to improve with a streamlined project creation flow, improved performance experience improvements and a try-out experience. Get started with Content Understanding in the [Azure AI Foundry](https://aka.ms/cu-landing) today.
4960

5061
## April 2025
5162

0 commit comments

Comments
 (0)