Skip to content

Commit 6db0740

Browse files
authored
Merge pull request #1319 from laujan/add-varsha-generative-updates
Add varsha document overview updates
2 parents f71d5e1 + 6ced681 commit 6db0740

File tree

4 files changed

+97
-104
lines changed

4 files changed

+97
-104
lines changed

articles/ai-services/content-understanding/document/custom/overview.md

Lines changed: 0 additions & 97 deletions
This file was deleted.
Lines changed: 94 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,94 @@
1+
---
2+
title: Azure AI Content Understanding custom document extraction overview
3+
titleSuffix: Azure AI services
4+
description: Learn about Azure AI Content Understanding document solutions.
5+
author: laujan
6+
ms.author: lajanuar
7+
manager: nitinme
8+
ms.service: azure
9+
ms.topic: overview
10+
ms.date: 11/19/2024
11+
ms.custom: ignite-2024-understanding-release
12+
---
13+
14+
# Content Understanding document solutions overview (preview)
15+
16+
> [!IMPORTANT]
17+
>
18+
> * Azure AI Content Understanding is available in preview. Public preview releases provide early access to features that are in active development.
19+
> * Features, approaches, and processes may change or have constrained capabilities, prior to General Availability (GA).
20+
> * For more information, *see* [**Supplemental Terms of Use for Microsoft Azure Previews**](https://azure.microsoft.com/support/legal/preview-supplemental-terms).
21+
22+
23+
Content Understanding is a cloud-based [Azure AI service](../../what-are-ai-services.md) designed to facilitate the efficient extraction of content, structure, and specific fields from documents and forms. It offers a comprehensive suite of APIs combined with an intuitive studio experience for optimal efficiency.
24+
25+
By using Content Understanding, organizations can streamline data collection and processing, driving operational efficiency, data-driven decision-making, and innovation. With customizable APIs, Content Understanding allows for easy extraction of fields or content from documents and forms, adaptable to specific business needs.
26+
27+
28+
## Common use cases
29+
30+
Document Extraction can process complex documents with various formats, templates, and unstructured data:
31+
32+
* **Contract Lifecycle Management**. Use document field extraction model to extract key fields, clauses, and obligations from a wide array of contract types.
33+
34+
* **Loan and Mortgage Applications**. Automation of loan and mortgage application process enables banks, lenders, and government entities to process loan and mortgage applications quicker.
35+
36+
* **Financial Services**. Analyze complex documents like financial reports and asset management reports, with the new document field extraction model.
37+
38+
* **Expense management**. Receipts and invoices from various retailers and businesses need to be parsed to validate the expenses. Document field extraction can extract expenses across different formats and documents with varying templates.
39+
40+
41+
:::image type="content" source="../media/document/extraction-overview.png" alt-text="Screenshot of document extraction flow.":::
42+
43+
44+
## Document extraction capabilities
45+
46+
Content extraction enables the extraction of both printed and handwritten text from forms and documents, delivering business-ready content that is immediately actionable, usable, or adaptable for further development within your organization.
47+
48+
### Add-on capabilities
49+
50+
Use the add-on features to extend the results to include more features extracted from your documents. Some add-on features incur extra costs. These optional features can be enabled and disabled depending on the scenario of the document extraction. To enable a feature, add the associated feature name to the features query string property. You can enable more than one add-on feature on a request by providing a comma-separated list of features.
51+
52+
* Layout: Extract layout information like Paragraphs, Sections, Tables, and more along with text with 'enableLayout' add-on capability.
53+
54+
* Barcode: The "enableBarcode" capability extracts all identified barcodes in the documents.
55+
56+
* Formula: The 'enableFormula' capability extracts all identified formulas, such as mathematical equations from the documents.
57+
58+
* Locales: Enabling the 'languages' feature predicts the detected primary language for each text line along with the confidence score.
59+
60+
### Field extraction
61+
62+
Document field extraction allows you to extract distinct data from forms and documents, with a wide variety of visual templates, specific to your use case. For example, you can extract customer name, billing address, due date, and amount due, line items, and other key data from an invoice. You can extract termination date, agreement date, lease terms, and other key data from contracts.
63+
You can train field extraction model with a single document or can add more sample documents with labeling to improve the field extraction for your document type.
64+
65+
Note: The layout add-on capability must be enabled by default for Field extraction
66+
67+
68+
## Benefits of Content Understanding document extraction
69+
70+
* **Accuracy and Reliability:** Document extraction AI models are built to deliver accurate data extraction, reducing errors and improving efficiency.
71+
72+
* **Scalability:** Easily scale your document processing capabilities to meet the growing demands of your business.
73+
74+
* **Customizable:** Tailor document extraction models to your specific requirements, ensuring the perfect fit for your unique workflows.
75+
76+
* **Grounded results:** Localize the data extracted in the documents, ensuring the response is generated from the content, to enable human review workflows.
77+
78+
* **High confidence scores:** Maximize efficiency and minimize costs in automation workflows, using confidence scores.
79+
80+
## Input requirements
81+
82+
For a complete list of Content Understanding supported input formats, see our [Service quotas and limits](../service-limits.md) page.
83+
84+
## Supported languages and regions
85+
86+
For a complete list of supported languages and regions, see our [Language and region support](../language-region-support.md) page.
87+
88+
## Next step
89+
90+
Get started using Content Understanding APIs with our [Content Understanding REST API quickstart](../quickstart/use-rest-api.md) and learn how to add more samples with labels to improve field extraction quality (link).
91+
92+
93+
94+
223 KB
Loading

articles/ai-services/content-understanding/toc.yml

Lines changed: 3 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -30,18 +30,14 @@ items:
3030
displayName: speech, audio, voice, recognition, synthesis, speaker, identification, verification, diarization, transcription, translation, language, understanding, sentiment, analysis, emotion, detection, pronunciation, model
3131
href: audio/overview.md
3232
- name: Video
33-
displayName: video, audio, voice, recognition, synthesis, speaker, identification, verification, diarization, transcription, translation, language, understanding, sentiment, analysis, emotion, detection, pronunciation, model
3433
href: video/overview.md
34+
displayName: video, audio, voice, recognition, synthesis, speaker, identification, verification, diarization, transcription, translation, language, understanding, sentiment, analysis, emotion, detection, pronunciation, model
3535
- name: Image
3636
displayName: image, OCR, optical character recognition, text, extraction, analysis, detection, recognition, model
3737
href: image/overview.md
3838
- name: Document
39-
items:
40-
- name: Custom
41-
items:
42-
- name: Custom document extraction overview
43-
displayName: document, text, images, video, audio, multi-modal, visual, structured, content, field, extraction
44-
href: document/custom/overview.md
39+
displayName: document, text, images, video, audio, visual, structured, content, field, extraction
40+
href: document/overview.md
4541
- name: Scenario Templates
4642
items:
4743
- name: Post-call analytics workflow

0 commit comments

Comments
 (0)