|
| 1 | +--- |
| 2 | +title: Azure AI Content Understanding custom document extraction overview |
| 3 | +titleSuffix: Azure AI services |
| 4 | +description: Learn about Azure AI Content Understanding document solutions. |
| 5 | +author: laujan |
| 6 | +ms.author: lajanuar |
| 7 | +manager: nitinme |
| 8 | +ms.service: azure |
| 9 | +ms.topic: overview |
| 10 | +ms.date: 11/19/2024 |
| 11 | +ms.custom: ignite-2024-understanding-release |
| 12 | +--- |
| 13 | + |
| 14 | +# Content Understanding document solutions overview (preview) |
| 15 | + |
| 16 | +> [!IMPORTANT] |
| 17 | +> |
| 18 | +> * Azure AI Content Understanding is available in preview. Public preview releases provide early access to features that are in active development. |
| 19 | +> * Features, approaches, and processes may change or have constrained capabilities, prior to General Availability (GA). |
| 20 | +> * For more information, *see* [**Supplemental Terms of Use for Microsoft Azure Previews**](https://azure.microsoft.com/support/legal/preview-supplemental-terms). |
| 21 | +
|
| 22 | + |
| 23 | +Content Understanding is a cloud-based [Azure AI service](../../what-are-ai-services.md) designed to facilitate the efficient extraction of content, structure, and specific fields from documents and forms. It offers a comprehensive suite of APIs combined with an intuitive studio experience for optimal efficiency. |
| 24 | + |
| 25 | +By using Content Understanding, organizations can streamline data collection and processing, driving operational efficiency, data-driven decision-making, and innovation. With customizable APIs, Content Understanding allows for easy extraction of fields or content from documents and forms, adaptable to specific business needs. |
| 26 | + |
| 27 | + |
| 28 | +## Common use cases |
| 29 | + |
| 30 | +Document Extraction can process complex documents with various formats, templates, and unstructured data: |
| 31 | + |
| 32 | +* **Contract Lifecycle Management**. Use document field extraction model to extract key fields, clauses, and obligations from a wide array of contract types. |
| 33 | + |
| 34 | +* **Loan and Mortgage Applications**. Automation of loan and mortgage application process enables banks, lenders, and government entities to process loan and mortgage applications quicker. |
| 35 | + |
| 36 | +* **Financial Services**. Analyze complex documents like financial reports and asset management reports, with the new document field extraction model. |
| 37 | + |
| 38 | +* **Expense management**. Receipts and invoices from various retailers and businesses need to be parsed to validate the expenses. Document field extraction can extract expenses across different formats and documents with varying templates. |
| 39 | + |
| 40 | + |
| 41 | +:::image type="content" source="../media/document/extraction-overview.png" alt-text="Screenshot of document extraction flow."::: |
| 42 | + |
| 43 | + |
| 44 | +## Document extraction capabilities |
| 45 | + |
| 46 | +Content extraction enables the extraction of both printed and handwritten text from forms and documents, delivering business-ready content that is immediately actionable, usable, or adaptable for further development within your organization. |
| 47 | + |
| 48 | +### Add-on capabilities |
| 49 | + |
| 50 | +Use the add-on features to extend the results to include more features extracted from your documents. Some add-on features incur extra costs. These optional features can be enabled and disabled depending on the scenario of the document extraction. To enable a feature, add the associated feature name to the features query string property. You can enable more than one add-on feature on a request by providing a comma-separated list of features. |
| 51 | + |
| 52 | +* Layout: Extract layout information like Paragraphs, Sections, Tables, and more along with text with 'enableLayout' add-on capability. |
| 53 | + |
| 54 | +* Barcode: The "enableBarcode" capability extracts all identified barcodes in the documents. |
| 55 | + |
| 56 | +* Formula: The 'enableFormula' capability extracts all identified formulas, such as mathematical equations from the documents. |
| 57 | + |
| 58 | +* Locales: Enabling the 'languages' feature predicts the detected primary language for each text line along with the confidence score. |
| 59 | + |
| 60 | +### Field extraction |
| 61 | + |
| 62 | +Document field extraction allows you to extract distinct data from forms and documents, with a wide variety of visual templates, specific to your use case. For example, you can extract customer name, billing address, due date, and amount due, line items, and other key data from an invoice. You can extract termination date, agreement date, lease terms, and other key data from contracts. |
| 63 | +You can train field extraction model with a single document or can add more sample documents with labeling to improve the field extraction for your document type. |
| 64 | + |
| 65 | +Note: The layout add-on capability must be enabled by default for Field extraction |
| 66 | + |
| 67 | + |
| 68 | +## Benefits of Content Understanding document extraction |
| 69 | + |
| 70 | +* **Accuracy and Reliability:** Document extraction AI models are built to deliver accurate data extraction, reducing errors and improving efficiency. |
| 71 | + |
| 72 | +* **Scalability:** Easily scale your document processing capabilities to meet the growing demands of your business. |
| 73 | + |
| 74 | +* **Customizable:** Tailor document extraction models to your specific requirements, ensuring the perfect fit for your unique workflows. |
| 75 | + |
| 76 | +* **Grounded results:** Localize the data extracted in the documents, ensuring the response is generated from the content, to enable human review workflows. |
| 77 | + |
| 78 | +* **High confidence scores:** Maximize efficiency and minimize costs in automation workflows, using confidence scores. |
| 79 | + |
| 80 | +## Input requirements |
| 81 | + |
| 82 | +For a complete list of Content Understanding supported input formats, see our [Service quotas and limits](../service-limits.md) page. |
| 83 | + |
| 84 | +## Supported languages and regions |
| 85 | + |
| 86 | +For a complete list of supported languages and regions, see our [Language and region support](../language-region-support.md) page. |
| 87 | + |
| 88 | +## Next step |
| 89 | + |
| 90 | +Get started using Content Understanding APIs with our [Content Understanding REST API quickstart](../quickstart/use-rest-api.md) and learn how to add more samples with labels to improve field extraction quality (link). |
| 91 | + |
| 92 | + |
| 93 | + |
| 94 | + |
0 commit comments