|
1 | 1 | ---
|
2 |
| -title: Azure AI Document Intelligence (formerly Form Recognizer) custom generative field extraction |
| 2 | +title: Azure AI Document Intelligence (formerly Form Recognizer) custom generative document field extraction |
3 | 3 | titleSuffix: Azure AI services
|
4 | 4 | description: Custom generative AI model extracts user-specified fields from documents across a wide variety of visual templates.
|
5 | 5 | author: laujan
|
6 | 6 | manager: nitinme
|
7 | 7 | ms.service: azure-ai-document-intelligence
|
8 |
| -ms.custom: |
9 |
| - - ignite-2023 |
10 | 8 | ms.topic: overview
|
11 |
| -ms.date: 08/07/2024 |
| 9 | +ms.date: 08/09/2024 |
12 | 10 | ms.author: lajanuar
|
13 | 11 | monikerRange: '>=doc-intel-4.0.0'
|
14 | 12 | ---
|
15 | 13 |
|
16 |
| -# Document Intelligence custom generative model |
| 14 | +# Document Field extraction - custom generative AI model |
17 | 15 |
|
18 |
| -[!INCLUDE [preview-version-notice](includes/preview-notice.md)] |
| 16 | +> [!IMPORTANT] |
| 17 | +> |
| 18 | +> * Document Intelligence public preview releases provide early access to features that are in active development. Features, approaches, and processes may change, prior to General Availability (GA), based on user feedback. |
| 19 | +> * The public preview version of Document Intelligence client libraries default to REST API version [**2024-07-31-preview**](/rest/api/aiservices/operation-groups?view=rest-aiservices-2024-07-31-preview&preserve-view=true) and is currently only available in the following Azure regions. |
| 20 | +> * **East US** |
| 21 | +> * **West US2** |
| 22 | +> * **West Europe** |
| 23 | +> * **North Central US** |
| 24 | +> |
| 25 | +> * **The new custom generative model in AI Studio is only available in the North Central US region**: |
19 | 26 |
|
20 |
| -The custom generative model combines the power of document understanding with Large Language Models (LLMs) and the rigor and schema from custom extraction capabilities. Custom generative extraction enables you to easily automate data extraction workflows for any type of document, with minimal labeling and greater accuracy and speed. |
| 27 | +The document field extraction (custom generative AI) model utilizes generative AI to extract user-specified fields from documents across a wide variety of visual templates. The custom generative AI model combines the power of document understanding with Large Language Models (LLMs) and the rigor and schema from custom extraction capabilities to create a model with high accuracy in minutes. With this generative model type, you can start with a single document and go through the schema addition and model creation process with minimal labeling. The custom generative model allows developers and enterprises to easily automate data extraction workflows with greater accuracy and speed for any type of document. The custom generative AI model excels in extracting simple fields from documents without labeled samples. However, providing a few labeled samples improves the extraction accuracy for complex fields and user-defined fields like tables. You can use the [REST API](/rest/api/aiservices/operation-groups?view=rest-aiservices-2024-07-31-preview&preserve-view=true) or client libraries to submit a document for analysis with a model build and use the custom generative process. |
21 | 28 |
|
22 |
| -## Custom generative model key features |
| 29 | +## Custom generative AI model benefits |
23 | 30 |
|
24 | 31 | * **Automatic labeling**. Utilize large language models (LLM) and extract user-specified fields for various document types and visual templates.
|
| 32 | + |
25 | 33 | * **Improved Generalization**. Extract data from unstructured data and varying document templates with higher accuracy.
|
26 |
| -* **Grounded results**. Localize the data extracted in documents and ensure the response is generated from the content and enables human review workflows. |
27 |
| -* **High confidence scores**. Use confidence scores and quickly filter high quality extracted data for downstream processing and lower manual review time. |
| 34 | + |
| 35 | +* **Grounded results**. Localize the data extracted in the documents. Custom generative models ground the results where applicable, ensuring the response is generated from the content and enable human review workflows. |
| 36 | + |
| 37 | +* **Confidence scores**. Use confidence scores for each extracted field to, filter high quality extracted data, maximize straight through processing of documents and minimize human review costs. |
28 | 38 |
|
29 | 39 | ### Common use cases
|
30 | 40 |
|
31 |
| -* **Contract Lifecycle Management**. Build a generative model and extract the fields, clauses, and obligations from a wide array of contract types. |
32 |
| -* **Loan & Mortgage Applications**. Automation of loan and mortgage application process enables banks, lenders, and government entities to quickly process loan and mortgage application. |
33 |
| -* **Financial Services**. Analyze complex documents like financial reports and asset management reports. |
34 |
| -* **Expense management**. The custom generative model can extract expenses, receipts, and invoices with varying formats and templates. |
| 41 | +* **Contract Lifecycle Management**. Build a generative model and extract the fields, clauses, and obligations from a wide array of contract types. |
| 42 | + |
| 43 | +* **Loan & Mortgage Applications**. Automation of loan and mortgage application process enables banks, lenders, and government entities to quickly process loan and mortgage application. |
| 44 | + |
| 45 | +* **Financial Services**. With the custom generative AI model, analyze complex documents like financial reports and asset management reports. |
| 46 | + |
| 47 | +* **Expense management**. Receipts and invoices from various retailers and businesses need to be parsed to validate the expenses. The custom generative AI model can extract expenses across different formats and documents with varying templates. |
| 48 | + |
| 49 | +### Managing the training dataset |
| 50 | + |
| 51 | +With our other custom models, you need to maintain the dataset, add new samples, and train the model for accuracy improvements. With the custom generative AI model, the labeled documents are transformed, encrypted, and stored as part of the model. This process ensures that the model can continually use the labeled samples to improve the extraction quality. As with other custom models, models are stored in Microsoft storage, and you can delete them anytime. |
| 52 | + |
| 53 | +The Document Intelligence service does manage your datasets, but your documents are stored encrypted and only used to improve the model results for your specific model. A service-manged key can be used to encrypt your data or it can be optionally encrypted with a customer managed key. The change in management and lifecycle of the dataset only applies to custom generative models. |
35 | 54 |
|
36 | 55 | ## Model capabilities
|
37 | 56 |
|
38 |
| -The Custom generative model currently supports dynamic table with the `2024-07-31-preview` and the following fields: |
| 57 | +Field extraction custom generative model currently supports dynamic table with the `2024-07-31-preview` and the following fields: |
39 | 58 |
|
40 | 59 | | Form fields | Selection marks | Tabular fields | Signature | Region labeling | Overlapping fields |
|
41 | 60 | |:--:|:--:|:--:|:--:|:--:|:--:|
|
42 | 61 | |Supported| Supported |Supported| Unsupported |Unsupported |Supported|
|
43 | 62 |
|
44 | 63 | ## Build mode
|
45 | 64 |
|
46 |
| -The `build custom model` operation supports custom _template_, _neural_ and _generative_ models, _see_ [Custom model build mode](concept-custom.md#build-mode): |
| 65 | +The `build custom model` operation supports custom **template**, **neural**, and **generative** models, _see_ [Custom model build mode](concept-custom.md#build-mode). Here are the differences in the model types: |
| 66 | + |
| 67 | +* **Custom generative AI models** can process complex documents with various formats, varied templates, and unstructured data. |
| 68 | + |
| 69 | +* **Custom neural models** support complex document processing and also support more variance in pages for structured and semi-structured documents. |
47 | 70 |
|
48 |
| -* **Custom generative models** can process complex documents in various formats, templates, and unstructured data. |
49 |
| -* **Custom neural models** support complex document processing and also support more variance in page for structured and semi-structured documents. |
50 | 71 | * **Custom template models** rely on consistent visual templates, such as questionnaires or applications, to extract the labeled data.
|
51 | 72 |
|
52 | 73 | ## Languages and locale support
|
53 | 74 |
|
54 |
| -The custom generative model `2024-07-31-preview` version supports the **en-us** locale. For more information on language support, *see* [Language support - custom models](language-support-custom.md). |
| 75 | +Field extraction custom generative model `2024-07-31-preview` version supports the **en-us** locale. For more information on language support, _see_ [Language support - custom models](language-support-custom.md). |
55 | 76 |
|
56 | 77 | ## Region support
|
57 | 78 |
|
58 |
| -The custom generative model `2024-07-31-preview` version is only available in `North Central US`. |
| 79 | +Field extraction custom generative model `2024-07-31-preview` version is only available in `North Central US`. |
59 | 80 |
|
60 |
| -## Input requirements |
| 81 | +## Input requirements |
61 | 82 |
|
62 | 83 | [!INCLUDE [input requirements](./includes/input-requirements.md)]
|
63 | 84 |
|
64 | 85 | ## Best practices
|
65 | 86 |
|
66 | 87 | * **Representative data**. Use representative documents that target actual data distribution, and train a high-quality custom generative model. For example, if the target document includes partially filled tabular fields, add training documents that consist of partially filled tables. Or if field is named date, values for this field should be a date as random strings can affect model performance.
|
| 88 | + |
67 | 89 | * **Field naming**. Choose a precise field name that represents the field values. For example, for a field value containing the Transaction Date, consider naming the field _TransactionDate_ instead of `Date1`.
|
68 |
| -* **Field Description**. Provide more contextual information in description to help clarify the field that needs to be extracted. Examples include location in the document, potential field labels it may be associated with, ways to differentiate with other terms that could be ambiguous. |
69 |
| -* **Variation**. Custom generative models can generalize across different document templates of the same document type. As a best practice, create a single model for all variations of a document type. Ideally, include a visual template for each type, especially for ones that |
| 90 | + |
| 91 | +* **Field Description**. Provide more contextual information in description to help clarify the field that needs to be extracted. Examples include location in the document, potential field labels it can be associated with, and ways to differentiate with other terms that could be ambiguous. |
| 92 | + |
| 93 | +* **Variation**. Custom generative models can generalize across different document templates of the same document type. As a best practice, create a single model for all variations of a document type. Ideally, include a visual template for each type, especially for ones that |
70 | 94 |
|
71 | 95 | ## Service guidance
|
72 | 96 |
|
73 | 97 | * The Custom Generative preview model doesn't currently support fixed table and signature extraction.
|
| 98 | + |
74 | 99 | * Inference on the same document could yield slightly different results across calls and is a known limitation of current `GPT` models.
|
| 100 | + |
75 | 101 | * Confidence scores for each field might vary. We recommend testing with your representative data to establish the confidence thresholds for your scenario.
|
76 |
| -* Grounding, especially for tabular fields, is challenging and might not be perfect in some cases. |
| 102 | + |
| 103 | +* Grounding, especially for tabular fields, is challenging and might not be perfect in some cases. |
| 104 | + |
77 | 105 | * Latency for large documents is high and a known limitation in preview.
|
| 106 | + |
78 | 107 | * Composed models don't support custom generative extraction.
|
79 | 108 |
|
80 | 109 | ## Training a model
|
81 | 110 |
|
82 | 111 | Custom generative models are available with the `2024-07-31-preview` version and later models.
|
83 | 112 |
|
84 |
| -The `build operation` to train model supports the ```buildMode``` property, to train a custom generative model, set the ```buildMode``` to ```generative```. |
| 113 | +The `build operation` to train model supports the `buildMode` property, to train a custom generative model, set the `buildMode` to `generative`. |
85 | 114 |
|
86 | 115 | ```bash
|
87 | 116 |
|
|
0 commit comments