Skip to content

Commit 2df87e5

Browse files
authored
Merge pull request #4944 from laujan/jp-4942-overview
Jp 4942 overview
2 parents 52771eb + 44d7a1b commit 2df87e5

File tree

1 file changed

+10
-2
lines changed
  • articles/ai-services/content-understanding/document

1 file changed

+10
-2
lines changed

articles/ai-services/content-understanding/document/overview.md

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -28,12 +28,13 @@ Document analyzers can process complex documents in various formats and template
2828
* **Loan and mortgage applications**: Automate processing to enable quicker handling by banks, lenders, and government entities.
2929
* **Financial services**: Analyze complex documents like financial reports and asset management reports.
3030
* **Expense management**: Parse receipts and invoices from various retailers to validate expenses across different formats and templates.
31+
* **Document sets and knowledge base scenarios**: Extract key fields from document sets as a whole, and add reference data that handle tasks like validation and enrichment by applying multi-step reasoning.
3132

3233
## Document analyzer capabilities
3334

3435
:::image type="content" source="../media/document/document-capabilities.png" alt-text="Screenshot of document extraction flow.":::
3536

36-
### Content Extraction
37+
### Content extraction
3738

3839
Content extraction forms the foundation of Azure AI Content Understanding's document analysis capabilities, transforming unstructured documents into structured, machine-readable data.
3940
It precisely captures both printed and handwritten text while preserving the document's structure through advanced layout analysis.
@@ -48,12 +49,17 @@ It precisely captures both printed and handwritten text while preserving the doc
4849
- **Paragraphs**: Detects and categorizes text segments based on their document context and role.
4950
- **Tabular data**: Recognizes and extracts table structures, including complex formats with spanning cells and multi-page layouts.
5051
- **Hierarchical sections**: Maps content organization through section headers and nested content relationships.
52+
- RAG
53+
- **RAG solutions**: Content extraction forms the foundation of effective RAG systems by transforming raw multimodal data into structured, searchable formats optimized for retrieval. Learn more about building RAG solutions on our [retrieval-augmented generation](../concepts/retrieval-augmented-generation.md) page.
5154

5255
### Field extraction
5356

5457
Field extraction empowers you to extract, classify, and generate structured data from various documents and forms, customized to meet your unique requirements. The process of transforming unstructured content into organized, actionable information simplifies data management, improves searchability, and supports automated workflows. For instance, you can seamlessly extract customer details, billing addresses, and itemized charges from invoices, or identify contractual parties, renewal dates, and payment terms in legal agreements. To maximize efficiency, you can utilize prebuilt analyzer templates—such as ones tailored for invoices—or design bespoke analyzers from scratch, enhancing precision through the labeling of more sample documents.
5558

56-
### Field extraction methods
59+
The confidence and grounding API is an opt-in feature. Set `estimateFieldSourceAndConfidence` as `true` to opt in for confidence and grounding for field extraction.
60+
61+
62+
#### Field extraction methods
5763

5864
Azure AI Content Understanding provides versatile methods for field extraction, enabling precise and tailored processing of document content:
5965

@@ -77,6 +83,8 @@ Content Understanding delivers powerful document analysis capabilities designed
7783

7884
- **Enhanced accuracy and reliability:** Achieve precise extraction and classification of critical business data, significantly reducing errors and improving operational efficiency across automated workflows.
7985

86+
- **Agents-ready:** Process your diverse input and deliver output in a standard format that's ready for an agent's workflow. Outputs can give your application an understanding of user intent, with data supported by a `strongly-typed` schema that makes it easier to quickly get data in a format ready for your code.
87+
8088
## Input requirements
8189
For detailed information on supported input document formats, refer to our [Service quotas and limits](../service-limits.md) page.
8290

0 commit comments

Comments
 (0)