Skip to content

Commit 9c5ac42

Browse files
authored
Merge pull request #4890 from laujan/kate-4466-pro-standard-modes-only
standard and pro modes
2 parents b7e20f7 + 005221e commit 9c5ac42

File tree

2 files changed

+98
-0
lines changed

2 files changed

+98
-0
lines changed
Lines changed: 95 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,95 @@
1+
---
2+
title: Azure AI Content Understanding standard and pro modes
3+
titleSuffix: Azure AI services
4+
description: Learn about Azure AI Content Understanding standard and pro modes.
5+
author: laujan
6+
ms.author: kabrow
7+
manager: nitinme
8+
ms.service: azure-ai-content-understanding
9+
ms.topic: overview
10+
ms.date: 05/19/2025
11+
---
12+
13+
# Azure AI Content Understanding pro and standard modes
14+
15+
Azure AI Content Understanding is an advanced generative AI service designed to derive structured insights from multi-modal content such as documents, images, videos, and audio. With the introduction of the `2025-05-01-preview` version, the service now offers two distinct modes: `standard` and `pro`.
16+
17+
* **Standard**: This mode serves as the default solution for processing diverse content types. It's optimized to provide efficient schema extraction tailored to specific tasks across all data formats. This mode emphasizes cost-effectiveness and reduced latency, ensuring structured insights are accessible for your general processing needs.
18+
19+
* **Pro**: This mode is designed for advanced use cases, particularly those requiring multi-step reasoning, and complex decision-making. It supports complex tasks such as identifying inconsistencies, drawing inferences, and making sophisticated decisions. The pro mode allows input from multiple content files and includes the option to reference external data for enriched and validated results. Currently, pro mode is only offered for your document-based data.
20+
21+
## Standard mode overview
22+
23+
The Content Understanding standard mode delivers structured insights across various data types, including documents, videos, images, audio, and text. While it doesn't support data inferencing, it minimizes cost and latency, making it ideal for broad, data-centric scenarios. This mode enables the creation or customization of schemas to extract precise insights tailored to your needs. Additionally, it incorporates data labeling for document data, allowing human input to enhance the quality of your outputs.
24+
25+
### Standard mode: use case
26+
27+
Standard is the ideal service for extracting the exact insights you need on any type of data. If you're just looking to unlock the content of your data, your scenario may not require complex reasoning or decision making. Scenarios that standard mode works great for include:
28+
29+
* Structuring data to power your RAG search workflows and integrating with [AI Search](../../../search/search-what-is-azure-search.md).
30+
* Extracting data to integrate with [Microsoft Fabric](https://blog.fabric.microsoft.com/en-US/blog/).
31+
* Analyzing advertising videos to screen for content guidelines.
32+
* Segmenting video footage to create chapters and identify ideal advertising breaks.
33+
* Extracting critical data points from sports games and providing post-game recaps.
34+
35+
## Pro mode overview
36+
37+
Content Understanding pro mode is tailored for customers with highly complex use cases, offering advanced multi-step reasoning capabilities. It enables reasoning over both input content and reference data provided by customers, making it ideal for scenarios requiring deep comprehension and complex decision-making. By incorporating reference data, pro mode adds context to each request, aiding tasks such as validation and enrichment, which reduces the need for human intervention and boosts productivity. For multiple input documents, it supports integration of reference datasets enriching workflows with external data for linking, validation, and enrichment. Currently, pro mode is exclusively available for document data.
38+
39+
40+
### Pro mode reference data
41+
42+
When your goal is to extract specific data points from your documents, reference data defines these data points and provides the service context it needs to validate whether the data passes the required criteria. For example, if you're looking to analyze invoices to ensure they're consistent with a contractual agreement, the signed contract serves as the reference data, allowing the service to compare the data in the contract to the individual invoices. In doing this comparison, the service applies reasoning to validate that either the invoices are in accordance with the contract or identify discrepancies and flag for further review.
43+
44+
### Multi-step reasoning
45+
46+
Multi-step reasoning offers the ability to decompose complex problems into a simples tasks. Multi-step reasoning takes data analysis a step further than extracting and aggregating structured data and allows you to draw conclusions on that data, minimizing the need for human review. Examples of the types of questions pro mode can answer with multi-step reasoning include:
47+
48+
* Does x match y?
49+
* Does x pass the outlined criteria?
50+
* Does x scenario follow the required guidelines?
51+
* Does the total equal the sum of the items?
52+
* Find all inconsistencies between the invoice and the contract.
53+
54+
## Standard and pro mode features
55+
56+
Not sure which mode is right for your scenario? The following charts compare standard and pro mode features.
57+
58+
| Feature | Standard mode | Pro mode |
59+
|----|----|----|
60+
| **Large documents** | Yes | Yes |
61+
| **Field mode** | Yes | Yes |
62+
| **Ability to extract, classify, and generate fields** | Yes | Yes |
63+
| **Grounding and confidence scores** | Yes | No |
64+
| **Input document type** | Documents, images, video, audio | Documents |
65+
| **Max fields** | 100 | 100 |
66+
| **Multiple input document processing** | No | Yes |
67+
| **Reference dataset integration** | No | Yes |
68+
| **Multistep reasoning** | No | Yes |
69+
70+
71+
## Apply standard or pro mode to your scenarios
72+
73+
You can apply both Content Understanding standard and pro modes to just about any scenario, but how you build your solution depends on the questions you're aiming to answer. The following scenarios present examples of how you might apply standard and pro modes to your data.
74+
75+
| Scenario | Standard mode | Pro mode|
76+
|----|----|----|
77+
| **Invoice analysis** | Extract insights on invoice data at scale and enable RAG search and further data analysis and visualization. Answer questions like: <br> &bullet; Extract purchase order number, total, due date, and line items for entry into database. | Analyze invoices and contractual agreements with clients and apply multi-step reasoning to draw conclusions on that data. Answer questions like: <br> &bullet; Does this invoice fulfill the contractual agreement we have in place with this client? <br> &bullet; Does this invoice need further review |
78+
| **Call center transcript analytics** | Extract insights on large volumes of call center data to gain valuable insights on sentiment, understand customer issues, and create targeted training to address major pain points. Answer questions like: <br> &bullet; What are the main issues customers are calling about? <br> &bullet; What is the average length of calls made about x issue? | Analyze call center transcript data and apply multi-step reasoning to understand how call center employees are addressing customer needs, and if they're following guidelines. Answer questions like: <br> &bullet; Did the call center employee introduce themselves? <br> &bullet; Did this answer "pass" certain criteria? |
79+
| **Mortgage application processing** | Extract the key values from mortgage application data and make it searchable and more easily accessible. Answer questions like: <br> &bullet; What year was the mortgage application submitted? <br> &bullet; What are the names on the application? | Analyze supplementary supporting documentation and mortgage applications to determine whether a prospective home buyer provides all the necessary documentation to secure a mortgage. Answer questions like: <br> &bullet; Do the names and social security numbers on the mortgage application match the supporting documentation? |
80+
81+
## Try pro mode
82+
83+
You can try out the features of both Content Understanding standard and pro modes using the [Azure AI Foundry](https://ai.azure.com/explore/aiservices/vision/contentunderstanding). The service enables you to bring your own data and experiment with all the functionalities of both modes in a lightweight, no-code approach to help you find the best fit for your unique scenario.
84+
85+
### Pro mode known limitations
86+
87+
* Content Understanding pro mode currently doesn't offer confidence scores or grounding. It currently supports generative and classification of your fields but doesn't support extraction only.
88+
* Content Understanding pro mode is currently only available for documents.
89+
90+
## Next steps
91+
92+
For more information on document processing, see [Document processing overview](../document/overview.md).
93+
94+
95+

articles/ai-services/content-understanding/toc.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,9 @@ items:
7676
- name: Accuracy and confidence
7777
displayName: accuracy, confidence, analyzers, optimization, fields, scores
7878
href: concepts/accuracy-confidence.md
79+
- name: "🆕 Modes: standard and pro"
80+
displayName: standard, pro, modes, analyzers, optimization, fields
81+
href: concepts/standard-pro-modes.md
7982
- name: 🆕 Classifier
8083
displayName: classifier, text, images, video, audio, multimodal, visual, structured, content, field, extraction
8184
href: concepts/classifier.md

0 commit comments

Comments
 (0)