You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/content-understanding/document/overview.md
+10-2Lines changed: 10 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,12 +28,13 @@ Document analyzers can process complex documents in various formats and template
28
28
***Loan and mortgage applications**: Automate processing to enable quicker handling by banks, lenders, and government entities.
29
29
***Financial services**: Analyze complex documents like financial reports and asset management reports.
30
30
***Expense management**: Parse receipts and invoices from various retailers to validate expenses across different formats and templates.
31
+
***Document sets and knowledge base scenarios**: Extract key fields from document sets as a whole, and add reference data that handle tasks like validation and enrichment by applying multi-step reasoning.
31
32
32
33
## Document analyzer capabilities
33
34
34
35
:::image type="content" source="../media/document/document-capabilities.png" alt-text="Screenshot of document extraction flow.":::
35
36
36
-
### Content Extraction
37
+
### Content extraction
37
38
38
39
Content extraction forms the foundation of Azure AI Content Understanding's document analysis capabilities, transforming unstructured documents into structured, machine-readable data.
39
40
It precisely captures both printed and handwritten text while preserving the document's structure through advanced layout analysis.
@@ -48,12 +49,17 @@ It precisely captures both printed and handwritten text while preserving the doc
48
49
-**Paragraphs**: Detects and categorizes text segments based on their document context and role.
49
50
-**Tabular data**: Recognizes and extracts table structures, including complex formats with spanning cells and multi-page layouts.
50
51
-**Hierarchical sections**: Maps content organization through section headers and nested content relationships.
52
+
- RAG
53
+
-**RAG solutions**: Content extraction forms the foundation of effective RAG systems by transforming raw multimodal data into structured, searchable formats optimized for retrieval. Learn more about building RAG solutions on our [retrieval-augmented generation](../concepts/retrieval-augmented-generation.md) page.
51
54
52
55
### Field extraction
53
56
54
57
Field extraction empowers you to extract, classify, and generate structured data from various documents and forms, customized to meet your unique requirements. The process of transforming unstructured content into organized, actionable information simplifies data management, improves searchability, and supports automated workflows. For instance, you can seamlessly extract customer details, billing addresses, and itemized charges from invoices, or identify contractual parties, renewal dates, and payment terms in legal agreements. To maximize efficiency, you can utilize prebuilt analyzer templates—such as ones tailored for invoices—or design bespoke analyzers from scratch, enhancing precision through the labeling of more sample documents.
55
58
56
-
### Field extraction methods
59
+
The confidence and grounding API is an opt-in feature. Set `estimateFieldSourceAndConfidence` as `true` to opt in for confidence and grounding for field extraction.
60
+
61
+
62
+
#### Field extraction methods
57
63
58
64
Azure AI Content Understanding provides versatile methods for field extraction, enabling precise and tailored processing of document content:
-**Enhanced accuracy and reliability:** Achieve precise extraction and classification of critical business data, significantly reducing errors and improving operational efficiency across automated workflows.
79
85
86
+
-**Agents-ready:** Process your diverse input and deliver output in a standard format that's ready for an agent's workflow. Outputs can give your application an understanding of user intent, with data supported by a `strongly-typed` schema that makes it easier to quickly get data in a format ready for your code.
87
+
80
88
## Input requirements
81
89
For detailed information on supported input document formats, refer to our [Service quotas and limits](../service-limits.md) page.
0 commit comments