|
| 1 | +--- |
| 2 | +title: "Quickstart: Document intelligence Studio | v3.0" |
| 3 | +titleSuffix: Azure Applied AI Services |
| 4 | +description: Form and document processing, data extraction, and analysis using Document intelligence Studio |
| 5 | +author: laujan |
| 6 | +manager: nitinme |
| 7 | +ms.service: applied-ai-services |
| 8 | +ms.subservice: forms-recognizer |
| 9 | +ms.topic: quickstart |
| 10 | +ms.date: 03/03/2023 |
| 11 | +ms.author: lajanuar |
| 12 | +--- |
| 13 | + |
| 14 | +<!-- markdownlint-disable MD001 --> |
| 15 | + |
| 16 | +# Get started: Document intelligence Studio |
| 17 | + |
| 18 | +[Document intelligence Studio](https://formrecognizer.appliedai.azure.com/) is an online tool for visually exploring, understanding, and integrating features from the Document intelligence service in your applications. You can get started by exploring the pretrained models with sample or your own documents. You can also create projects to build custom template models and reference the models in your applications using the [Python SDK](get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true) and other quickstarts. |
| 19 | + |
| 20 | +> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE56n49] |
| 21 | +
|
| 22 | +## Prerequisites for new users |
| 23 | + |
| 24 | +* An active [**Azure account**](https://azure.microsoft.com/free/cognitive-services/). If you don't have one, you can [**create a free account**](https://azure.microsoft.com/free/). |
| 25 | +* A [**Document intelligence**](https://portal.azure.com/#create/Microsoft.CognitiveServicesFormRecognizer) or [**Cognitive Services multi-service**](https://portal.azure.com/#create/Microsoft.CognitiveServicesAllInOne) resource. |
| 26 | + |
| 27 | +> [!TIP] |
| 28 | +> Create a Cognitive Services resource if you plan to access multiple cognitive services under a single endpoint/key. For Document intelligence access only, create a Document intelligence resource. Please note that you'll need a single-service resource if you intend to use [Azure Active Directory authentication](../../../active-directory/authentication/overview-authentication.md). |
| 29 | +
|
| 30 | +## Models |
| 31 | + |
| 32 | +Prebuilt models help you add Document intelligence features to your apps without having to build, train, and publish your own models. You can choose from several prebuilt models, each of which has its own set of supported data fields. The choice of model to use for the analyze operation depends on the type of document to be analyzed. Document intelligence currently supports the following prebuilt models: |
| 33 | + |
| 34 | +#### Document analysis |
| 35 | + |
| 36 | +* [**General document**](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=document): extract text, tables, structure, key-value pairs. |
| 37 | +* [**Layout**](https://formrecognizer.appliedai.azure.com/studio/layout): extract text, tables, selection marks, and structure information from documents (PDF, TIFF) and images (JPG, PNG, BMP). |
| 38 | +* [**Read**](https://formrecognizer.appliedai.azure.com/studio/read): extract text lines, words, their locations, detected languages, and handwritten style if detected from documents (PDF, TIFF) and images (JPG, PNG, BMP). |
| 39 | + |
| 40 | +#### Prebuilt |
| 41 | + |
| 42 | +* [**Invoice**](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=invoice): extract text, selection marks, tables, key-value pairs, and key information from invoices. |
| 43 | +* [**Receipt**](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=receipt): extract text and key information from receipts. |
| 44 | +* [**Health insurance card**](https://formrecognizer.appliedai.azure.com/studio): extract insurer, member, prescription, group number and other key information from US health insurance cards. |
| 45 | +* [**W-2**](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=tax.us.w2): extract text and key information from W-2 tax forms. |
| 46 | +* [**ID document**](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=idDocument): extract text and key information from driver licenses and international passports. |
| 47 | +* [**Business card**](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=businessCard): extract text and key information from business cards. |
| 48 | + |
| 49 | +#### Custom |
| 50 | + |
| 51 | +* [**Custom extraction models**](https://formrecognizer.appliedai.azure.com/studio): extract information from forms and documents with custom extraction models. Quickly train a model by labeling as few as five sample documents. |
| 52 | +* [**Custom classification model**](https://formrecognizer.appliedai.azure.com/studio): train a custom classifier to distinguish between the different document types within your applications. Quickly train a model with as few as two classes and five samples per class. |
| 53 | + |
| 54 | +#### Gated preview models |
| 55 | + |
| 56 | +> [!NOTE] |
| 57 | +> To request access for gated preview models in Document intelligence Studio, complete and submit the [**Document intelligence private preview request form**](https://aka.ms/form-recognizer/preview/survey). |
| 58 | +
|
| 59 | +* [**General document with query fields**](https://formrecognizer.appliedai.azure.com/studio): extract labels, values such as names, dates, and amounts from documents. |
| 60 | +* [**Contract**](https://formrecognizer.appliedai.azure.com/studio): extract the title and signatory party information (including names, references, and addresses) from contracts. |
| 61 | +* [**Vaccination card**](https://formrecognizer.appliedai.azure.com/studio): extract card holder name, health provider, and vaccination records from US COVID-19 vaccination cards. |
| 62 | +* [**US 1098 tax form**](https://formrecognizer.appliedai.azure.com/studio): extract mortgage interest information from US 1098 tax forms. |
| 63 | +* [**US 1098-E tax form**](https://formrecognizer.appliedai.azure.com/studio): extract student loan information from US 1098-E tax forms. |
| 64 | +* [**US 1098-T tax form**](https://formrecognizer.appliedai.azure.com/studio): extract tuition information from US 1098-T forms. |
| 65 | + |
| 66 | +> [!NOTE] |
| 67 | +> To request access for gated preview models in Document intelligence Studio, complete and submit the [**Document intelligence private preview request form**](https://aka.ms/form-recognizer/preview/survey). |
| 68 | +
|
| 69 | +After you've completed the prerequisites, navigate to [Document intelligence Studio General Documents](https://formrecognizer.appliedai.azure.com/studio/document). |
| 70 | + |
| 71 | +In the following example, we use the General Documents feature. The steps to use other pretrained features like [W2 tax form](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=tax.us.w2), [Read](https://formrecognizer.appliedai.azure.com/studio/read), [Layout](https://formrecognizer.appliedai.azure.com/studio/layout), [Invoice](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=invoice), [Receipt](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=receipt), [Business card](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=businessCard), and [ID documents](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=idDocument) models are similar. |
| 72 | + |
| 73 | + :::image border="true" type="content" source="../media/quickstarts/form-recognizer-general-document-demo-preview3.gif" alt-text="Selecting the General Document API to analysis a document in the Document intelligence Studio."::: |
| 74 | + |
| 75 | +1. Select a Document intelligence service feature from the Studio home page. |
| 76 | + |
| 77 | +1. This step is a one-time process unless you've already selected the service resource from prior use. Select your Azure subscription, resource group, and resource. (You can change the resources anytime in "Settings" in the top menu.) Review and confirm your selections. |
| 78 | + |
| 79 | +1. Select the Analyze button to run analysis on the sample document or try your document by using the Add command. |
| 80 | + |
| 81 | +1. Use the controls at the bottom of the screen to zoom in and out and rotate the document view. |
| 82 | + |
| 83 | +1. Observe the highlighted extracted content in the document view. Hover your mouse over the keys and values to see details. |
| 84 | + |
| 85 | +1. In the output section's Result tab, browse the JSON output to understand the service response format. |
| 86 | + |
| 87 | +1. In the Code tab, browse the sample code for integration. Copy and download to get started. |
| 88 | + |
| 89 | +## Added prerequisites for custom projects |
| 90 | + |
| 91 | +In addition to the Azure account and a Document intelligence or Cognitive Services resource, you need: |
| 92 | + |
| 93 | +### Azure Blob Storage container |
| 94 | + |
| 95 | +A **standard performance** [**Azure Blob Storage account**](https://portal.azure.com/#create/Microsoft.StorageAccount-ARM). You create containers to store and organize your training documents within your storage account. If you don't know how to create an Azure storage account with a container, following these quickstarts: |
| 96 | + |
| 97 | +* [**Create a storage account**](../../../storage/common/storage-account-create.md). When creating your storage account, make sure to select **Standard** performance in the **Instance details → Performance** field. |
| 98 | +* [**Create a container**](../../../storage/blobs/storage-quickstart-blobs-portal.md#create-a-container). When creating your container, set the **Public access level** field to **Container** (anonymous read access for containers and blobs) in the **New Container** window. |
| 99 | + |
| 100 | +### Configure CORS |
| 101 | + |
| 102 | +[CORS (Cross Origin Resource Sharing)](/rest/api/storageservices/cross-origin-resource-sharing--cors--support-for-the-azure-storage-services) needs to be configured on your Azure storage account for it to be accessible from the Document intelligence Studio. To configure CORS in the Azure portal, you need access to the CORS tab of your storage account. |
| 103 | + |
| 104 | +1. Select the CORS tab for the storage account. |
| 105 | + |
| 106 | + :::image type="content" source="../media/quickstarts/cors-setting-menu.png" alt-text="Screenshot of the CORS setting menu in the Azure portal."::: |
| 107 | + |
| 108 | +1. Start by creating a new CORS entry in the Blob service. |
| 109 | + |
| 110 | +1. Set the **Allowed origins** to `https://formrecognizer.appliedai.azure.com`. |
| 111 | + |
| 112 | + :::image type="content" source="../media/quickstarts/cors-updated-image.png" alt-text="Screenshot that shows CORS configuration for a storage account."::: |
| 113 | + |
| 114 | + > [!TIP] |
| 115 | + > You can use the wildcard character '*' rather than a specified domain to allow all origin domains to make requests via CORS. |
| 116 | +
|
| 117 | +1. Select all the available 8 options for **Allowed methods**. |
| 118 | + |
| 119 | +1. Approve all **Allowed headers** and **Exposed headers** by entering an * in each field. |
| 120 | + |
| 121 | +1. Set the **Max Age** to 120 seconds or any acceptable value. |
| 122 | + |
| 123 | +1. Select the save button at the top of the page to save the changes. |
| 124 | + |
| 125 | +CORS should now be configured to use the storage account from Document intelligence Studio. |
| 126 | + |
| 127 | +### Sample documents set |
| 128 | + |
| 129 | +1. Go to the [Azure portal](https://portal.azure.com/#home) and navigate as follows: **Your storage account** → **Data storage** → **Containers** |
| 130 | + |
| 131 | + :::image border="true" type="content" source="../media/sas-tokens/data-storage-menu.png" alt-text="Screenshot: Data storage menu in the Azure portal."::: |
| 132 | + |
| 133 | +1. Select a **container** from the list. |
| 134 | + |
| 135 | +1. Select **Upload** from the menu at the top of the page. |
| 136 | + |
| 137 | + :::image border="true" type="content" source="../media/sas-tokens/container-upload-button.png" alt-text="Screenshot: container upload button in the Azure portal."::: |
| 138 | + |
| 139 | +1. The **Upload blob** window appears. |
| 140 | + |
| 141 | +1. Select your file(s) to upload. |
| 142 | + |
| 143 | + :::image border="true" type="content" source="../media/sas-tokens/upload-blob-window.png" alt-text="Screenshot: upload blob window in the Azure portal."::: |
| 144 | + |
| 145 | +> [!NOTE] |
| 146 | +> By default, the Studio will use form documents that are located at the root of your container. However, you can use data organized in folders by specifying the folder path in the Custom form project creation steps. *See* [**Organize your data in subfolders**](../build-training-data-set.md#organize-your-data-in-subfolders-optional) |
| 147 | +
|
| 148 | +## Custom models |
| 149 | + |
| 150 | +To create custom models, you start with configuring your project: |
| 151 | + |
| 152 | +1. From the Studio home, select the Custom model card to open the Custom models page. |
| 153 | + |
| 154 | +1. Use the "Create a project" command to start the new project configuration wizard. |
| 155 | + |
| 156 | +1. Enter project details, select the Azure subscription and resource, and the Azure Blob storage container that contains your data. |
| 157 | + |
| 158 | +1. Review and submit your settings to create the project. |
| 159 | + |
| 160 | +1. From the labeling view, define the labels and their types that you're interested in extracting. |
| 161 | + |
| 162 | +1. Select the text in the document and select the label from the drop-down list or the labels pane. |
| 163 | + |
| 164 | +1. Label four more documents to get at least five documents labeled. |
| 165 | + |
| 166 | +1. Select the Train command and enter model name, select whether you want the custom template (form) or custom neural (document) model to start training your custom model. |
| 167 | + |
| 168 | +1. Once the model is ready, use the Test command to validate it with your test documents and observe the results. |
| 169 | + |
| 170 | +:::image border="true" type="content" source="../media/quickstarts/form-recognizer-custom-model-demo-v3p2.gif" alt-text="Document intelligence Custom model demo"::: |
| 171 | + |
| 172 | +### Labeling as tables |
| 173 | + |
| 174 | +> [!NOTE] |
| 175 | +> * With the release of API versions 2022-06-30-preview and later, custom template models will add support for [cross page tabular fields (tables)](../concept-custom-template.md#tabular-fields). |
| 176 | +> * With the release of API versions 2022-06-30-preview and later, custom neural models will support [tabular fields (tables)](../concept-custom-template.md#tabular-fields) and models trained with API version 2022-08-31, or later will accept tabular field labels. |
| 177 | +
|
| 178 | +1. Use the Delete command to delete models that aren't required. |
| 179 | + |
| 180 | +1. Download model details for offline viewing. |
| 181 | + |
| 182 | +1. Select multiple models and compose them into a new model to be used in your applications. |
| 183 | + |
| 184 | +Using tables as the visual pattern: |
| 185 | + |
| 186 | +For custom form models, while creating your custom models, you may need to extract data collections from your documents. Data collections may appear in a couple of formats. Using tables as the visual pattern: |
| 187 | + |
| 188 | +* Dynamic or variable count of values (rows) for a given set of fields (columns) |
| 189 | + |
| 190 | +* Specific collection of values for a given set of fields (columns and/or rows) |
| 191 | + |
| 192 | +**Label as dynamic table** |
| 193 | + |
| 194 | +Use dynamic tables to extract variable count of values (rows) for a given set of fields (columns): |
| 195 | + |
| 196 | +1. Add a new "Table" type label, select "Dynamic table" type, and name your label. |
| 197 | + |
| 198 | +1. Add the number of columns (fields) and rows (for data) that you need. |
| 199 | + |
| 200 | +1. Select the text in your page and then choose the cell to assign to the text. Repeat for all rows and columns in all pages in all documents. |
| 201 | + |
| 202 | +:::image border="true" type="content" source="../media/quickstarts/custom-tables-dynamic.gif" alt-text="Document intelligence labeling as dynamic table example"::: |
| 203 | + |
| 204 | +**Label as fixed table** |
| 205 | + |
| 206 | +Use fixed tables to extract specific collection of values for a given set of fields (columns and/or rows): |
| 207 | + |
| 208 | +1. Create a new "Table" type label, select "Fixed table" type, and name it. |
| 209 | + |
| 210 | +1. Add the number of columns and rows that you need corresponding to the two sets of fields. |
| 211 | + |
| 212 | +1. Select the text in your page and then choose the cell to assign it to the text. Repeat for other documents. |
| 213 | + |
| 214 | +:::image border="true" type="content" source="../media/quickstarts/custom-tables-fixed.gif" alt-text="Document intelligence Labeling as fixed table example"::: |
| 215 | + |
| 216 | +### Signature detection |
| 217 | + |
| 218 | +>[!NOTE] |
| 219 | +> Signature fields are currently only supported for custom template models. When training a custom neural model, labeled signature fields are ignored. |
| 220 | +
|
| 221 | +To label for signature detection: (Custom form only) |
| 222 | + |
| 223 | +1. In the labeling view, create a new "Signature" type label and name it. |
| 224 | + |
| 225 | +1. Use the Region command to create a rectangular region at the expected location of the signature. |
| 226 | + |
| 227 | +1. Select the drawn region and choose the Signature type label to assign it to your drawn region. Repeat for other documents. |
| 228 | + |
| 229 | +:::image border="true" type="content" source="../media/quickstarts/custom-signature.gif" alt-text="Document intelligence labeling for signature detection example"::: |
| 230 | + |
| 231 | +## Next steps |
| 232 | + |
| 233 | +* Follow our [**Document intelligence v3.0 migration guide**](../v3-migration-guide.md) to learn the differences from the previous version of the REST API. |
| 234 | +* Explore our [**v3.0 SDK quickstarts**](get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true) to try the v3.0 features in your applications using the new SDKs. |
| 235 | +* Refer to our [**v3.0 REST API quickstarts**](get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true) to try the v3.0 features using the new REST API. |
| 236 | + |
| 237 | +[Get started with the Document intelligence Studio](https://formrecognizer.appliedai.azure.com). |
0 commit comments