Skip to content

Commit 23ecde0

Browse files
Apply suggestions from code review
Co-authored-by: Patrick Farley <[email protected]>
1 parent de188a1 commit 23ecde0

File tree

1 file changed

+18
-18
lines changed

1 file changed

+18
-18
lines changed

articles/ai-services/content-understanding/choosing_guide_doc_processing.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -21,28 +21,28 @@ Azure AI Document Intelligence is the trusted choice for many document-centric s
2121

2222
* Document digitization or [Optical Character Recognition (OCR)](/azure/ai-services/document-intelligence/prebuilt/read?view=doc-intel-4.0.0&branch=main&tabs=sample-code) to extract printed or handwritten text from documents.
2323

24-
* Document structure extraction with [Layout](https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/prebuilt/layout?view=doc-intel-4.0.0&branch=main&tabs=rest%2Csample-code) to extract table, selection marks, sections and document structure along with OCR.
24+
* Document structure extraction with [Layout](/azure/ai-services/document-intelligence/prebuilt/layout?view=doc-intel-4.0.0&branch=main&tabs=rest%2Csample-code) to extract table, selection marks, sections, and document structure along with OCR.
2525

26-
* Document [classification](https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/train/custom-classifier?view=doc-intel-4.0.0) to accurately identify, split and classify multiple documents.
26+
* Document [classification](/azure/ai-services/document-intelligence/train/custom-classifier?view=doc-intel-4.0.0) to accurately identify, split and classify multiple documents.
2727

28-
* Document field extraction with [prebuilt models](https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/model-overview?view=doc-intel-4.0.0) for predefined schema extraction from standard document type like tax, mortgage, bank checks and forms with higher variations like invoices, receipts, ID and [custom models](https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/train/custom-model?view=doc-intel-4.0.0) to label and train your own model.
28+
* Document field extraction with [prebuilt models](/azure/ai-services/document-intelligence/model-overview?view=doc-intel-4.0.0) for predefined schema extraction from standard document types like tax, mortgage, bank checks, forms with higher variations like invoices, receipts, and ID, and [custom models](/azure/ai-services/document-intelligence/train/custom-model?view=doc-intel-4.0.0) to label and train your own model.
2929

3030

3131
## Azure AI Content Understanding
3232

33-
Built on the same foundational capabilities of Document Intelligence, it extends document scenarios to images and embedded content, expanding to truly multimodal scenarios with audio and video. Content Understanding is built for a content processing with Generative AI, maximizing your ability to generate the specific output you need with inferred fields, enrichments, validations and reasoning. Content Understanding simplifies the process of building an effective document processing solution, packaging these capabilities into a simple and easy to use analyzer building process with zero shot output and no labeling, all while providing a rich schema that includes confidence scores and grounding, whereever applicable. Content Understanding provides a rich set of tools among others that can be configured to solve most document processing challenges.
33+
Built on the same foundational capabilities of Document Intelligence, it extends document scenarios to images and embedded content, expanding to truly multimodal scenarios with audio and video. Content Understanding is built for content processing with Generative AI, improving your ability to generate the specific output you need with inferred fields, enrichments, validations, and reasoning. Content Understanding simplifies the process of building an effective document processing solution, packaging these capabilities into a simple and easy to use analyzer building process with zero-shot output and no labeling, all while providing a rich schema that includes confidence scores and grounding, wherever applicable. Content Understanding provides a rich set of tools among others that can be configured to solve most document processing challenges.
3434

3535
* Inferred fields & enrichments: Output required that are not always directly present in the document, like the total tax on an invoice or the jurisdiction on a contract that can be inferred from the parties’ addresses or clause wording.
3636
* Multi-file input: Process multiple input files in the same request and extract a unified schema across all the input files.
3737
* Classification & Splitting: For parsing large files into individual documents for routing and schema extraction.
38-
* Reasoning: Intelligent Document Processing typically is a multi-step process with extraction, validation, aggregation and reviews. Content Understanding is built for IDP, simplifying everything into a single step process.
39-
* Post processing & validations: Use the description to define any post processing rules like converting date formats, currency codes and consistency checks.
38+
* Reasoning: Intelligent document processing typically is a multi-step process with extraction, validation, aggregation, and reviews. Content Understanding is built for IDP, simplifying everything into a single step process.
39+
* Post processing & validations: Use the description to define any post processing rules like converting date formats, currency codes, and consistency checks.
4040

41-
## Azure hosted LLM Models (Azure Open AI)
41+
## Azure-hosted LLMs (Azure Open AI)
4242

43-
For organizations requiring niche AI workflows, custom solutions built with Azure OpenAI Service/ or any other Azure based LLM services offer maximum flexibility. Developers can combine models like GPT-4o, Vision, Whisper, and Embeddings to build highly customized AI solutions, typically integrating Azure Document Intelligence/ Azure AI Content Understanding for pre-processing documents into custom workflows. This approach provides the maximum flexibility, but requires users to evaluate models, update models as needed, manage the prompts and optimize for costs. A common challenge with these solutions is the trade-off between cost management and accuracy as this approach lacks adequate tools to trigger reviews only for challenging cases.
43+
For organizations requiring niche AI workflows, custom solutions built with Azure OpenAI Service or any other Azure-based LLM services offer maximum flexibility. Developers can combine models like GPT-4o, Vision, Whisper, and Embeddings to build highly customized AI solutions, typically integrating Azure Document Intelligence or Azure AI Content Understanding for pre-processing documents into custom workflows. This approach provides the maximum flexibility but requires users to evaluate models, update models as needed, manage the prompts, and optimize for costs. A common challenge with these solutions is the trade-off between cost management and accuracy as this approach lacks adequate tools to trigger reviews only for challenging cases.
4444

45-
## Service Overview
45+
## Overview of services
4646

4747
Here’s a summary of the three available services:
4848
| Service | What it Does | Ideal For | Strengths | Core Features |
@@ -52,8 +52,8 @@ Here’s a summary of the three available services:
5252
| Build your own solution with Azure OpenAI Service | Build a solution with any Azure-hosted LLM models, Fully control on model, prompt and tools | Developers aiming to build, own and manage a solution that require fine grained control on models, costs and prompts | Maximum flexibility and control | Multiple options to plug and play with model choice, prompt tuning, workflow defination with complete flexibility in building each component |
5353

5454

55-
## Capabilites Overview
56-
Here's an capabilites overview for all three services.
55+
## Service capabilities
56+
Here's a capabilities overview for all three services.
5757

5858
| Capabilities | Document Intelligence | Content Understanding | Build Your Own with AOAI |
5959
|--------------------------|----------------------------------------------------|----------------------------------------------------|---------------------------------------------------|
@@ -90,9 +90,9 @@ Here's an capabilites overview for all three services.
9090

9191
---
9292

93-
## Guided Scenario Walkthrough
93+
## Guided scenario walkthrough
9494

95-
Let's take a look at various categories of document processing scenarios that you may encounter and how to navigate each of such scenarios with the best fitted service. Here are a few examples of different document processing scenarios, the associated challenges and the considerations for building an effective solution. If the document type you are processing is supported by a prebuilt, you should start there and only choose to build a custom solution if the prebuilt schema does not cover your scenario.
95+
Let's take a look at various categories of document processing scenarios that you may encounter and how to navigate each one with the best fitted service. Here are a few examples of different document processing scenarios, the associated challenges, and the considerations for building an effective solution. If the document type you are processing is supported by a prebuilt, you should start there and only choose to build a custom solution if the prebuilt schema does not cover your scenario.
9696

9797
Considerations:
9898

@@ -104,7 +104,7 @@ Considerations:
104104
* Build Effort: Effort to build the model including handling complex logic, business requriements, labeling data and putting complex workflows together.
105105
* Total cost of ownership: Comparative view of infrastructure, management and maintenance costs for your use case with handling scale.
106106

107-
### Scenario 1: Processing a Standardized, Single-Format Form
107+
### Scenario 1: Processing a standardized, single-format form
108108

109109
**Business Process**:
110110
Extract fixed fields like Name, Date of Birth, Address, Account Number, and other details from forms with identical templates every time. **Examples**:
@@ -120,7 +120,7 @@ Extract fixed fields like Name, Date of Birth, Address, Account Number, and othe
120120

121121
---
122122

123-
### Scenario 2: Managing Document with Few Known Variants
123+
### Scenario 2: Managing document with few known variants
124124

125125
**Business Process**:
126126
Extract consistent fields (name, amount, policy number, claim date) across a small, known set of templates. **Examples**:
@@ -140,7 +140,7 @@ Extract consistent fields (name, amount, policy number, claim date) across a sma
140140

141141
---
142142

143-
### Scenario 3: High-Variation Semi-Structured Documents
143+
### Scenario 3: High-variation semi-structured documents
144144

145145
**Business Process**:
146146
Extract key fields like Invoice Number, Vendor Name, Total Amount, Line Items, and Dates from highly varied documents with inconsistent templates. **Examples**:
@@ -159,7 +159,7 @@ Extract key fields like Invoice Number, Vendor Name, Total Amount, Line Items, a
159159
* Build a custom solution: Build and configure the components needed for parsing the documents (Layout), extracting the fields and any build any post-processing needed. The solution will need to be tested and verified with different variations and you will need to scale and manage the deployed solution. With no confidence scores, you are either accepting all results or reviewing all results based on the expected error rate.Shape
160160
---
161161

162-
### Scenario 4: Extracting Insights from Unstructured Documents
162+
### Scenario 4: Extracting insights from unstructured documents
163163

164164
**Business Process**:
165165
Extract, generate abstract details like obligations, summaries, inferencing details like contract parties, risk indicators, sentiment, or decisions from free-text, multi-page, narrative documents. **Examples**:
@@ -177,7 +177,7 @@ Extract, generate abstract details like obligations, summaries, inferencing deta
177177

178178
---
179179

180-
### Scenario 5: Multi-Document, Mixed Media Processing
180+
### Scenario 5: Multi-document, mixed media processing
181181

182182
**Business Process**:
183183
Aggregate content from diverse formats, cross-reference details, validate consistency (e.g., name matches across documents), and surface inconsistencies. **Examples**:

0 commit comments

Comments
 (0)