You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/concept-read.md
+14-9Lines changed: 14 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,15 +15,15 @@ ms.custom: ignite-fall-2021
15
15
16
16
# Form Recognizer read model
17
17
18
-
The prebuilt-read model extracts printed and handwritten textual elements including lines, words, locations, and detected languages from documents (PDF and TIFF) and images (JPG, PNG, and BMP). The read model is the foundation for all Form Recognizer models. Layout, general document, custom, and prebuilt models use the prebuilt-read model as a basis for extracting text from documents.
18
+
The Form Recognizer v3.0 preview includes the new Read API. Read extracts printed and handwritten from documents. The read model can detect lines, words, locations, and languages and is the core of all the other Form Recognizer models. Layout, general document, custom, and prebuilt models all use the read model as a foundation for extracting texts from documents.
19
19
20
20
## Development options
21
21
22
22
The following resources are supported by Form Recognizer v3.0:
@@ -77,7 +77,7 @@ Form Recognizer preview version supports several languages for the read model. *
77
77
78
78
### Text lines and words
79
79
80
-
Read API extracts text from documents and images with multiple text angles and colors. It accepts photos of documents, faxes, printed and/or handwritten (English only) text, and mixed modes. Text is extracted with information provided on lines, words, bounding boxes, confidence scores, and style (handwritten or other).
80
+
Read API extracts text from documents and images with multiple text angles and colors. It accepts photos of documents, faxes, printed and/or handwritten (English only) text, and mixed modes. Text is extracted from data provided in lines, words, bounding boxes, confidence scores, and style.
81
81
82
82
### Language detection (v3.0 preview)
83
83
@@ -93,12 +93,17 @@ For large multi-page documents, use the `pages` query parameter to indicate spec
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/how-to-guides/build-custom-model-v3.md
+11-11Lines changed: 11 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ manager: nitinme
7
7
ms.service: applied-ai-services
8
8
ms.subservice: forms-recognizer
9
9
ms.topic: how-to
10
-
ms.date: 02/16/2022
10
+
ms.date: 04/13/2022
11
11
ms.author: lajanuar
12
12
---
13
13
@@ -32,15 +32,15 @@ Follow these tips to further optimize your data set for training:
32
32
33
33
## Upload your training data
34
34
35
-
When you've put together the set of forms or documents that you'll use for training, you'll need to upload it to an Azure blob storage container. If you don't know how to create an Azure storage account with a container, following the [Azure Storage quickstart for Azure portal](../../../storage/blobs/storage-quickstart-blobs-portal.md). You can use the free pricing tier (F0) to try the service, and upgrade later to a paid tier for production.
35
+
Once you've put together the set of forms or documents for training, you'll need to upload it to an Azure blob storage container. If you don't know how to create an Azure storage account with a container, following the [Azure Storage quickstart for Azure portal](../../../storage/blobs/storage-quickstart-blobs-portal.md). You can use the free pricing tier (F0) to try the service, and upgrade later to a paid tier for production.
36
36
37
37
## Create a project in the Form Recognizer Studio
38
38
39
-
The Form Recognizer Studio provides and orchestrates all the API calls required to create the files required to complete your dataset and train your model.
39
+
The Form Recognizer Studio provides and orchestrates all the API calls required to complete your dataset and train your model.
40
40
41
-
1. Start by navigating to the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio). If this is your first time using the Studio, you'll need to [initialize it for use](../quickstarts/try-v3-form-recognizer-studio.md). Follow the [additional prerequisite for custom projects](../quickstarts/try-v3-form-recognizer-studio.md#additional-prerequisites-for-custom-projects) to configure the Studio to access your training dataset.
41
+
1. Start by navigating to the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio). The first time you use the Studio, you'll need to [initialize your subscription, resource group, and resource](../quickstarts/try-v3-form-recognizer-studio.md). Follow the [additional prerequisite for custom projects](../quickstarts/try-v3-form-recognizer-studio.md#additional-prerequisites-for-custom-projects) to configure the Studio to access your training dataset.
42
42
43
-
1. In the Studio select the **Custom models** tile, on the custom models page and select the **Create a project** button.
43
+
1. In the Studio, select the **Custom models** tile, on the custom models page and select the **Create a project** button.
44
44
45
45
:::image type="content" source="../media/how-to/studio-create-project.png" alt-text="Screenshot: Create a project in the Form Recognizer Studio.":::
46
46
@@ -53,7 +53,7 @@ The Form Recognizer Studio provides and orchestrates all the API calls required
53
53
54
54
:::image type="content" source="../media/how-to/studio-select-resource.png" alt-text="Screenshot: Select the Form Recognizer resource.":::
55
55
56
-
1. Next select the storage account where you uploaded the dataset you wish to use to train your custom model. The **Folder path** should be empty if your training documents are in the root of the container. If your documents are in a sub-folder, enter the relative path from the container root in the **Folder path** field. Once your storage account is configured, select continue.
56
+
1. Next select the storage account where you uploaded your custom model training dataset. The **Folder path** should be empty if your training documents are in the root of the container. If your documents are in a subfolder, enter the relative path from the container root in the **Folder path** field. Once your storage account is configured, select continue.
57
57
58
58
:::image type="content" source="../media/how-to/studio-select-storage.png" alt-text="Screenshot: Select the storage account.":::
59
59
@@ -71,13 +71,13 @@ You'll see the files you uploaded to storage on the left of your screen, with th
71
71
72
72
1. Enter a name for the field.
73
73
74
-
1. To assign a value to the field, simply choose a word or words in the document and select the field in either the dropdown or the field list on the right navigation bar. You'll see the labeled value below the field name in the list of fields.
74
+
1. To assign a value to the field, choose a word or words in the document and select the field in either the dropdown or the field list on the right navigation bar. You'll see the labeled value below the field name in the list of fields.
75
75
76
-
1. Repeat this process for all the fields you wish to label for your dataset
76
+
1. Repeat the process for all the fields you wish to label for your dataset.
77
77
78
-
1. Label the remaining documents in your dataset by selecting each document in the document list and selecting the text to be labeled
78
+
1. Label the remaining documents in your dataset by selecting each document and selecting the text to be labeled.
79
79
80
-
You now have all the documents in your dataset labeled. If you look at the storage account, you'll find a *.labels.json* and *.ocr.json* files that correspond to each document in your training dataset and an additional fields.json file. This is the training dataset that will be submitted to train the model.
80
+
You now have all the documents in your dataset labeled. If you look at the storage account, you'll find a *.labels.json* and *.ocr.json* files that correspond to each document in your training dataset and a new fields.json file. This training dataset will be submitted to train the model.
81
81
82
82
## Train your model
83
83
@@ -119,4 +119,4 @@ Congratulations you've trained a custom model in the Form Recognizer Studio! You
119
119
> [Learn about custom model types](../concept-custom.md)
120
120
121
121
> [!div class="nextstepaction"]
122
-
> [Learn about accuracy and confidence with custom models](../concept-accuracy-confidence.md)
122
+
> [Learn about accuracy and confidence with custom models](../concept-accuracy-confidence.md)
0 commit comments