Skip to content

Commit fb90997

Browse files
committed
add joe content
1 parent d656ebd commit fb90997

File tree

2 files changed

+17
-16
lines changed

2 files changed

+17
-16
lines changed

articles/ai-services/content-understanding/concepts/best-practices.md

Lines changed: 17 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -19,24 +19,28 @@ Azure AI Content Understanding in an innovative Generative AI service designed t
1919

2020
When defining a schema, it's essential to provide detailed field descriptions. Clear and concise descriptions guide the model to focus on the correct information, improving the accuracy of the output.
2121

22-
###    **Example**
22+
#####   ***Example***
2323

2424
* If you want to extract the date from an invoice, in addition to naming the field `"Date"`, provide a description such as:
25-
> **"The date when the invoice was issued, typically found at the top right corner of the document."**
2625

27-
###    **Example**
26+
**"The date when the invoice was issued, typically found at the top right corner of the document."**
27+
28+
#####   ***Example***
2829

2930
* Suppose you want to extract the `"Customer Name"` from an invoice. Your description might read:
30-
> **"The name of the customer or client to whom this invoice is addressed, usually located near the billing address. It should be the name of the business or person, but not the entire mailing address."**
31+
32+
**"The name of the customer or client to whom this invoice is addressed, usually located near the billing address. It should be the name of the business or person, but not the entire mailing address."**
33+
---
3134

3235
## Correct mistakes by editing field descriptions
3336

3437
If the system's output isn't meeting expectations, the first thing to try is refining and updating the field descriptions. By clarifying the context and being more explicit about what you need, you reduce ambiguity and improve accuracy.
3538

36-
###    **Example**
39+
#####   ***Example***
40+
41+
* If the `"shipping date"` field generated inconsistent or incorrect extraction, often after a "Dispatch Date" label, update it to something more precise like:
3742

38-
* If the `"Shipping date"` field generated inconsistent or incorrect extraction, often after a "Dispatch Date" label, update it to something more precise like:
39-
> **"The date when the products were shipped, typically found below the item list. It may also be labeled something similar like Delivery Date or Dispatch Date. Dates should typically have a format like 1/23/2024 or 01-04-2025."**
43+
**"The date when the products were shipped, typically found below the item list. It may also be labeled something similar like Delivery Date or Dispatch Date. Dates should typically have a format like 1/23/2024 or 01-04-2025."**
4044

4145
* This extra context guides the model to the right location in the document.
4246

@@ -45,25 +49,25 @@ If the system's output isn't meeting expectations, the first thing to try is ref
4549

4650
When you need the system to choose from a set of predefined options (for example, document type, product category, or status), use classification fields. When there's ambiguity with the options, provide clear descriptions for each option, enabling the model to categorize the data accurately.
4751

48-
###    **Example**
52+
#####   ***Example***
4953

5054
* If you need to classify documents as either `"Invoice"`, `"Claim"`, or `"Report"`, create a classification field with these words as category names.
5155

52-
###    **Example**
56+
#####   ***Example***
5357

5458
* When processing product images, you might need to assign them to categories like `"AlcoholicDrinks"`, `"SoftDrinks"`, `"Snacks"`, and `"DairyProducts"`. Since some items can appear similar, providing precise definitions for close-call cases can help. For example:
5559

56-
* **`"Alcoholic Drinks"`**: Beverages containing alcohol, such as beer, wine, and spirits. This category excludes soft drinks or other nonalcoholic beverages.
60+
* **`"Alcoholic Drinks"`**: Beverages containing alcohol, such as beer, wine, and spirits. This category excludes soft drinks or other nonalcoholic beverages.
5761

58-
* **`"Soft Drinks"`**: Carbonated nonalcoholic beverages, such as soda and sparkling water. This category doesn't include juices or alcoholic drinks.
62+
* **`"Soft Drinks"`**: Carbonated nonalcoholic beverages, such as soda and sparkling water. This category doesn't include juices or alcoholic drinks.
5963

6064
* By clearly defining each category, you ensure that the system correctly classifies products while minimizing misclassification.
6165

6266
## Use confidence scores to determine when human review is needed
6367

6468
Confidence scores help you decide when to involve human reviewers. Customers can interpret confidence scores using thresholds to decide which results need more reviews, minimizing the risk of errors.
6569

66-
###    **Example**
70+
#####   ***Example***
6771

6872
* For an invoice review use case, if a key extracted field like `"TotalInvoiceAmount"` has a confidence score under **0.80**, route that document to manual review. This helps ensure that a human verifies critical fields like invoice totals or legal statements when necessary.
6973

@@ -73,7 +77,7 @@ Confidence scores help you decide when to involve human reviewers. Customers can
7377

7478
When you're working with audio and video content, selecting a narrow set of languages for transcription can potentially reduce errors. The more languages you include, the more the system has to guess which language is being spoken, which cam increase misrecognition.
7579

76-
###    **Example**
80+
#####   ***Example***
7781

7882
* If you're certain that the content only contains English and Spanish, configuring your transcription to these two languages only can improve quality. But if the content accidentally includes other languages, such configuration can actually degrade overall quality.
7983

articles/ai-services/content-understanding/toc.yml

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -23,9 +23,6 @@ items:
2323
- name: FAQ
2424
displayName: FAQ, definition, updates, previews
2525
href: faq.yml
26-
- name: Best Practices
27-
displayName: Best Practices, analyzers, optimization, fields
28-
href: best-practices.md
2926
- name: Glossary
3027
displayName: glossary, definition, updates, previews
3128
href: glossary.md

0 commit comments

Comments
 (0)