Skip to content

Commit e4a40d3

Browse files
committed
add joe content
1 parent c7c9a37 commit e4a40d3

File tree

1 file changed

+3
-7
lines changed

1 file changed

+3
-7
lines changed

articles/ai-services/content-understanding/concepts/best-practices.md

Lines changed: 3 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
---
1+
22
title: Best practices for using Content Understanding
33
titleSuffix: Azure AI services
44
description: Learn how to best use Azure AI Content Understanding for document, image, video, and audio file content and field extractions.
@@ -8,16 +8,14 @@ manager: nitinme
88
ms.service: azure-ai-content-understanding
99
ms.topic: overview
1010
ms.date: 02/24/2025
11-
---
11+
1212

1313
# Best Practices for Content Understanding
1414

1515
Azure AI Content Understanding is an innovative Generative AI service designed to facilitate the precise and accurate analysis of extensive data sets. The service processes various content modalities, including documents, images, videos, and audio, transforming them into user-specified output formats.
1616

1717
This document provides guidance and best practices to effectively utilize Content Understanding for your data processing and analysis requirements.
1818

19-
---
20-
2119
## Use field descriptions to guide output
2220

2321
When defining a schema, it's essential to provide detailed field descriptions. Clear and concise descriptions guide the model to focus on the correct information, improving the accuracy of the output.
@@ -34,8 +32,7 @@ When defining a schema, it's essential to provide detailed field descriptions. C
3432

3533
* Suppose you want to extract the `Customer Name` from an invoice. Your description might read:
3634

37-
> `The name of the customer or client to whom this invoice is addressed, usually located near the billing address. It should be the name of the business or person, but not the entire mailing address.`
38-
---
35+
> `The name of the customer or client to whom this invoice is addressed, usually located near the billing address. It should be the name of the business or person, but not the entire mailing address.`
3936
4037
## Correct mistakes by editing field descriptions
4138

@@ -86,7 +83,6 @@ When you're working with audio and video content, selecting a narrow set of lang
8683

8784
* If you're certain that the content only contains English and Spanish, configuring your transcription to only these two languages can improve quality. But if the content accidentally includes other languages, such configuration can actually degrade overall quality.
8885

89-
9086
## Transcript, document text, and speaker data don't require fields
9187

9288
By default, Content Extraction information such as speech transcripts, document text extracted by `OCR`, and video key frames can be accessed directly from the analyzer output for immediate review or custom processing. There's no need to define a field in the schema for these items. Fields can be used when more processing is needed, for example, summarizing transcripts, identifying entities, or extracting specific items from `OCR`. Each field can instruct the system to extract or generate the content you need.

0 commit comments

Comments
 (0)