You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: notebooks/analyzer_training.ipynb
+37-7Lines changed: 37 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -15,7 +15,7 @@
15
15
"\n",
16
16
"Labeled data is a group of samples that have been tagged with one or more labels to add context or meaning, which is used to improve analyzer's performance.\n",
17
17
"\n",
18
-
"Please go to [Azure AI Foundry]() to use the labling tool to annotate your data.\n",
18
+
"In your own project, you will use [Azure AI Foundry](https://learn.microsoft.com/en-us/azure/ai-services/content-understanding/quickstart/use-ai-foundry) to use the labeling tool to annotate your data.\n",
19
19
"\n",
20
20
"In this notebook we will demonstrate after you have the labeled data, how to create analyzer with them and analyze your files.\n",
21
21
"\n",
@@ -42,8 +42,13 @@
42
42
"cell_type": "markdown",
43
43
"metadata": {},
44
44
"source": [
45
-
"## Analyzer template\n",
46
-
"In this sample we define a template for purchase order. We labeled the fields in the training data."
45
+
"## Analyzer template and local training folder set up\n",
46
+
"In this sample we define a template for receipts.\n",
47
+
"\n",
48
+
"The training folder should contain a flat (one-level) directory of labeled receipt documents. Each document includes:\n",
49
+
"- The original file (e.g., PDF or image).\n",
50
+
"- A corresponding labels.json file with labeled fields.\n",
51
+
"- A corresponding result.json file with OCR results."
"Before creating the analyzer, you should fill in the constant ANALYZER_ID with a relevant name to your task. Here, we generate a unique suffix so this cell can be run multiple times to create different analyzers.\n",
115
145
"\n",
116
-
"We use **TRAINING_DATA_SAS_URL** and **TRAINING_DATA_PATH** that's set in the Prerequisites step."
146
+
"We use **TRAINING_DATA_SAS_URL** and **TRAINING_DATA_PATH** that's set up in the `.env` file and used in the previous step."
0 commit comments