Merge pull request #4989 from laujan/paul-4986-prebuilt-analyzers

Stacyrch140 · web-flow · commit 5719dd7a5d38 · 2025-05-16T20:20:10.000-04:00
Paul 4986 prebuilt analyzers
diff --git a/articles/ai-services/content-understanding/concepts/prebuilt-analyzers.md b/articles/ai-services/content-understanding/concepts/prebuilt-analyzers.md
@@ -12,92 +12,51 @@ ms.date: 05/19/2025
 
 # Prebuilt analyzers in Azure AI Content Understanding
 
-Azure AI Content Understanding prebuilt analyzers are ready-to-use solutions designed to streamline standard content processing tasks such as document ingestion, search indexing, and retrieval-augmented generation (`RAG`). Analyzers extract structured insights from unstructured content, including documents, images, audio, and video files. They also allow users to define custom settings for content extraction and specify field extraction schemas. Once configured, an analyzer applies these settings consistently to process all incoming data in a systematically.
+Azure AI Content Understanding prebuilt analyzers are ready-to-use tools designed to streamline common content processing tasks. They support scenarios such as content ingestion for search and retrieval-augmented generation (RAG) workflows, and intelligent document processing (IDP) for extracting data from invoices or analyzing call center recordings. You can also [customize these analyzers](../tutorial/create-custom-analyzer.md) to extract more fields or refine outputs to better fit your specific workflow requirements.
 
-Analyzers enhance trial processes, offering streamlined experiences and the flexibility to be tailored by extending their functionalities to suit unique workflow needs. Key features include:
+## Prebuilt analyzers for content ingestion
 
-* **[Content parsers](#content-parsers-for-search-and-ingestion)** for general search and ingestion scenarios.
-* **[Scenario-specific predefined analyzers](#scenario-specific-predefined-analyzers)** for targeted use cases like invoices or call center transcripts.
-* **[Inheritance from prebuilt analyzers](#inheritance-and-customizing-prebuilt-analyzers)** to customize configuration and fields.
+Azure AI Content Understanding offers prebuilt analyzers that extract raw content with layout as markdown and perform essential semantic analysis, simplifying common content ingestion tasks. These capabilities enhance retrieval quality for downstream applications such as retrieval-augmented generation (RAG).
 
-## Content parsers for search and ingestion
+##### `prebuilt-documentAnalyzer`
 
-To streamline common content ingestion scenarios, Azure AI Content Understanding offers general purpose **prebuilt content analyzers**. These analyzers extract text, layout, and metadata from various content types.
+* Extracts text and layout details from documents and images.
+* Produces a concise summary of the document content.
 
+##### `prebuilt-imageAnalyzer`
 
-| Analyzer                  | Description                                                                 | Supported File Types |
-|:-------------------------|:-----------------------------------------------------------------------------|:--------------------|
-| `prebuilt-documentAnalyzer` | Extracts text, layout, and metadata using `OCR` for images and rendered files. Users can customize prebuilt content analyzers to modify configuration and add/remove fields. | `.pdf`, `.tiff`, `image`, `.docx`, `.rtf`, `.html`, `.md`, `.json`, `.xml`, `.csv`, `.tsv`, and `.txt` |
-| `prebuilt-imageAnalyzer`    | Generates a descriptive caption of an image and `OCR` is conceptually disabled. Users refine the description and/or add new fields by creating analyzer with baseAnalyzerId=prebuilt-imageAnalyzer.  | image                |
-| `prebuilt-audioAnalyzer`    | Produces a transcript, speaker diarization, and a summary for audio files. Users can add new fields by creating analyzer with baseAnalyzerId=prebuilt-audioAnalyzer.  | audio                |
-| `prebuilt-videoAnalyzer`    | Extracts keyframes, transcript, and video segmentation. Segmentation is enabled by default. Users can disable/customize segmentation by creating an analyzer with baseAnalyzerId=prebuilt-videoAnalyzer and changing segmentationMode property.                | video                |
+* Generates a descriptive caption for the image.
 
-Analyzers are optimized for `RAG` ingestion and search workflows, offering default behaviors suitable for indexing and summarizing large volumes of content.
+##### `prebuilt-audioAnalyzer`
 
-> [!NOTE]
->
-> * Currently, `OCR` is supported for `.pdf` and `.tiff` image files. Content elements from such files include span properties and bounding boxes via their source properties.
-> * For unsupported files, contents are extracted digitally. Content elements from these files include span properties to indicate their position in the returned markdown.
-> * There are no prebuilt models for `agentic` mode. Instead, users can create an analyzer with mode=pro starting from any document base analyzer to test out `agentic` behavior.
+* Extracts transcripts from audio files.
+* Performs speaker diarization to distinguish among different speakers.
+* Provides a summary of the audio content.
 
-## Scenario-specific predefined analyzers
+##### `prebuilt-videoAnalyzer`
 
-In addition to general content analyzers, Azure AI Content Understanding provides **prebuilt analyzers for specific business scenarios**  to target common scenarios. They can be further customized by setting them as the `baseAnalyzerId`:
+* Extracts transcripts from video files.
+* Identifies keyframes and camera shots.
+* Divides/segments the video into meaningful sections.
+* Generates a summary for each video segment.
 
-| Analyzer             | Description                                                     | Supported File Types |
-|:--------------------|:----------------------------------------------------------------|:--------------------|
-| `prebuilt-callCenter` | Extracts summary, sentiment, topics, and insights from call center transcripts. | audio |
-| `prebuilt-invoice`    | Extracts structured fields such as InvoiceId, Date, and Vendor from invoices. | `.pdf`, `.tiff`, and `image` files.|
 
-These analyzers bundle best practices and hidden configurations to deliver accurate extractions for their intended use cases while simplifying deployment by abstracting internal implementation details.
+## Prebuilt analyzers for intelligent document processing
 
+Content Understanding also includes prebuilt analyzers designed for specialized industry scenarios, enabling extraction of structured data from invoices and analysis of call center transcripts.
 
-## Inheritance and customizing prebuilt analyzers
+##### `prebuilt-invoice`
 
-With the **`2025-05-01-preview`**, any prebuilt analyzer can be inherited using `baseAnalyzerId` to create a custom analyzer. Inheritance allows for modification of existing fields, descriptions, types, and methods. Additionally, configuration settings such as `enableFormula`, `segmentationMode`, and others can be customized.
+* Extracts text and document layout as markdown from documents and images.
+* Extracts structured data from invoices, including invoice number, date, vendor, total amount, and line items. Supports various invoice formats and languages, enabling automated data capture for accounts payable processes and related scenarios.
 
-***Example***
+##### `prebuilt-callCenter`
 
-
-### Inherit from prebuilt document analyzer
-
-```json
-{
-  "baseAnalyzerId": "prebuilt-documentAnalyzer",
-  "fields": [
-    { "name": "InvoiceId", "type": "string", "method": "regex" },
-    { "name": "TotalAmount", "type": "currency", "method": "extractive" }
-  ],
-  "configuration": {
-    "enableFormula": true,
-    "tableFormat": "markdown"
-  }
-}
-```
-
-> [!IMPORTANT]
-> With the `2025-05-01-preview`, modifying a field description overwrites the internal refined description, potentially reducing extraction quality.
-> The `baseAnalyzerId` must be a prebuilt analyzer. Custom analyzers can't currently inherit from other custom analyzers.
-
-## Analyzer details and configurations
-
-* **Document Analyzer**: Uses `OCR` for `.pdf`,`.tiff`, and `image` files.
-* **Image Analyzer**: Doesn't use `OCR` but generates image descriptions.
-* **Audio Analyzer**: Returns transcript and summary extraction.
-* **Video Analyzer**: Returns keyframes, transcript, and segmentation.
-* **Call Center Analyzer**: Summarizes and extracts insights from audio. Supports audio text.
-* **Invoice Analyzer**: Returns structured field extraction from invoices. Supports `.pdf`, `.tiff`, and `image` files.
-
-
-## Billing and limits
-
-* **Documents**: Billing is calculated per page, slide, or sheet. For`.docx`, `.rtf`, `.html`, `.md`, `.msg`, `.eml`, `.json`, `.xml`, `.csv`, `.tsv`, and `.txt`, we count every 3k `UTF16 `characters as a page. Field extraction has a `fixed-per-1k` page rate
-* **Images**: There's no cost for image content extraction, however, generating a description invokes image field extraction charges.
-* **Audio/Video**: Billing is calculated on a per hour basis with 1-minute granularity. Charges are calculated for both audio/video content extraction and field extraction.
-* Maximum field limit: Currently there are 90 user-defined fields with 100 total to include reserved fields.
+* Extracts transcripts from audio files.
+* Distinguishes between speakers and assigns them to customer or agent roles.
+* Analyzes call center transcripts to generate summaries, determine customer sentiment, identify discussion topics, and more.
 
 ## Next steps
 
-* [Analyzer templates](analyzer-templates.md)
-
-
+* [Try out prebuilt analyzers using REST API](../quickstart/use-rest-api.md).
+* [Customize prebuilt analyzers](../tutorial/create-custom-analyzer.md).