Skip to content

Commit 8521437

Browse files
committed
edit for pub
1 parent a6a1038 commit 8521437

File tree

5 files changed

+15
-22
lines changed

5 files changed

+15
-22
lines changed

articles/applied-ai-services/form-recognizer/concept-custom-label-tips.md

Lines changed: 4 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -17,17 +17,13 @@ recommendations: false
1717

1818
This article highlights the best methods for labeling custom model datasets in the Form Recognizer Studio. Labeling documents can be time consuming when you have a large number of labels, long documents, or documents with varying structure. These tips should help you label documents more efficiently.
1919

20-
## Video: Custom labels best practices (part two)
20+
## Video: Custom labels best practices
2121

22-
The following is the second of two videos intended to help you build custom
23-
models with higher accuracy.
22+
* The following video is the second of two presentations intended to help you build custom models with higher accuracy (the first presentation explores [How to create a balanced data set](concept-custom-label.md#video-custom-label-tips-and-pointers)).
2423

25-
* This video, explores the best practices for labeling your selected documents. With consistent labeling and following these best practices, you
26-
should see an improvement in model performance.
24+
* Here, we'll examine best practices for labeling your selected documents. With consistent labeling and following these suggestions, you should see an improvement in model performance.
2725

28-
> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE5fZKB ]
29-
30-
* The first video (part one) explores [How to create a balanced data set](concept-custom-label.md#video-custom-label-tips-and-pointers-part-one).
26+
> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE5fZKB ]
3127
3228
## Search
3329

articles/applied-ai-services/form-recognizer/concept-custom-label.md

Lines changed: 5 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -32,24 +32,21 @@ A labeled dataset consists of several files:
3232

3333
* A `{file}.labels.json` file is created or updated when a field is labeled in a document. The label file contains the spans of text and associated polygons from the layout output for each span of text the user adds as a value for a specific field.
3434

35-
## Video: Custom label tips and pointers (part one)
35+
## Video: Custom label tips and pointers
3636

37-
The following is the first of two videos intended to help you build custom
38-
models with higher accuracy.
37+
* The following video is the first of two presentations intended to help you build custom models with higher accuracy (The second presentation examines [Best practices for labeling documents](concept-custom-label-tips.md#video-custom-labels-best-practices)).
3938

40-
* This video (part one), explores how to create a balanced data set and select the right documents to label. The process will set you on the path to a higher quality model.
39+
* Here, we'll explore how to create a balanced data set and select the right documents to label. This process will set you on the path to higher quality models.
4140

42-
> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RWWHru]
43-
44-
* The second video (part two), presents the [Best practices for labeling documents](concept-custom-label-tips.md#video-custom-labels-best-practices).
41+
> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RWWHru]
4542
4643
## Create a balanced dataset
4744

4845
Before you start labeling, it's a good idea to look at a few different samples of the document to identify which samples you want to use in your labeled dataset. A balanced dataset represents all the typical variations you would expect to see for the document. Creating a balanced dataset will result in a model with the highest possible accuracy. A few examples to consider are:
4946

5047
* **Document formats**: If you expect to analyze both digital and scanned documents, add a few examples of each type to the training dataset
5148

52-
* **Variations (template model)**: Consider splitting the dataset into folders and train a model for each of variation. Variations that include either structure or layout should be split into different models. You can then compose the individual models into a single [composed model](concept-composed-models.md).
49+
* **Variations (template model)**: Consider splitting the dataset into folders and train a model for each of variation. Any variations that include either structure or layout should be split into different models. You can then compose the individual models into a single [composed model](concept-composed-models.md).
5350

5451
* **Variations (Neural models)**: When your dataset has a manageable set of variations, about 15 or fewer, create a single dataset with a few samples of each of the different variations to train a single model. If the number of template variations is larger than 15, you'll train multiple models and [compose](concept-composed-models.md) them together.
5552

articles/applied-ai-services/form-recognizer/how-to-guides/build-a-custom-model.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -48,9 +48,9 @@ Once you've put together the set of forms or documents for training, you'll need
4848

4949
## Video: Train your custom model
5050

51-
Once you've gathered and uploaded your training dataset, you're ready to train your custom model. In the following video, we'll create a project and explore come of the fundamentals for labeling and training a model:
51+
Once you've gathered and uploaded your training dataset, you're ready to train your custom model. In the following video, we'll create a project and explore some of the fundamentals for labeling and training a model:
5252

53-
> [VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE5fX1c]
53+
> [VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE5fX1c]
5454
5555
## Create a project in the Form Recognizer Studio
5656

articles/applied-ai-services/form-recognizer/overview.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,9 +34,9 @@ Azure Form Recognizer is a cloud-based [Azure Applied AI Service](../../applied-
3434

3535
## Video: Form Recognizer models
3636

37-
The following video introduces the Form Recognizer models and their associated output to help you choose which one is best to address your document scenario needs.
37+
The following video introduces Form Recognizer models and their associated output to help you choose which one is best to address your document scenario needs.</br></br>
3838

39-
> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE5fX1b]
39+
> [!VIDEO https://www.microsoft.com/en-us/videoplayer/embed/RE5fX1b]
4040
4141
## Which Form Recognizer model should I use?
4242

articles/applied-ai-services/form-recognizer/toc.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -171,10 +171,10 @@ items:
171171
- name: Custom neural model
172172
displayName: selection, structure, tables, tabular, train, template, neural, build mode, signatures, custom
173173
href: concept-custom-neural.md
174-
- name: Custom labels and datasets
174+
- name: Create labeled datasets
175175
displayName: structure, selection, labels, tables, tabular, train
176176
href: concept-custom-label.md
177-
- name: Custom label tips and pointers
177+
- name: Labeling tips and pointers
178178
displayName: structure, selection, labels, tables, tabular, train, tips
179179
href: concept-custom-label-tips.md
180180
- name: Composed models

0 commit comments

Comments
 (0)