Skip to content

Commit 6126a66

Browse files
Merge pull request #4712 from laujan/jp-4661-4662-4667-4666-pr-updates
Jp 4661 4662 4667 4666 pr updates
2 parents 33974f1 + 7f9edc8 commit 6126a66

File tree

10 files changed

+126
-8
lines changed

10 files changed

+126
-8
lines changed

articles/ai-services/content-understanding/concepts/analyzer-templates.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,6 @@ manager: nitinme
88
ms.service: azure-ai-content-understanding
99
ms.topic: overview
1010
ms.date: 05/19/2025
11-
ms.custom: ignite-2024-understanding-release
1211
---
1312

1413
# Analyzer templates offered with Content Understanding

articles/ai-services/content-understanding/concepts/best-practices.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -87,4 +87,12 @@ When you're working with audio and video content, selecting a narrow set of lang
8787

8888
By default, Content Extraction information such as speech transcripts, document text extracted by `OCR`, and video key frames can be accessed directly from the analyzer output for immediate review or custom processing. There's no need to define a field in the schema for these items. Fields can be used when more processing is needed, for example, summarizing transcripts, identifying entities, or extracting specific items from `OCR`. Each field can instruct the system to extract or generate the content you need.
8989

90+
## Classifier category names and descriptions
91+
92+
To improve the classifier and splitting accuracy, it's important to give a good category name and description with context. For example:
93+
94+
* Common titles for category names (ex. Annual Financial Report, SEC Form 10-K)
95+
* Semantic definition of the category for descriptions (ex. receipts for expense reporting)
96+
* Common layout of the initial page in the description (two-column form)
97+
* Key content that uniquely identifies a category as a description ("2025" on the upper right)
9098

Lines changed: 94 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,94 @@
1+
---
2+
title: Azure AI Content Understanding classifier overview
3+
titleSuffix: Azure AI services
4+
description: Learn about Azure AI Content Understanding classifier solutions.
5+
author: laujan
6+
ms.author: lajanuar
7+
manager: nitinme
8+
ms.service: azure-ai-content-understanding
9+
ms.topic: overview
10+
ms.date: 05/19/2025
11+
---
12+
13+
# Content Understanding classifier
14+
15+
> [!IMPORTANT]
16+
>
17+
> * Classifier is only available for documents with the `2025-05-01-preview` release.
18+
> * Azure AI Content Understanding classifier is available in `2025-05-01-preview` release. Public preview releases provide early access to features that are in active development.
19+
> * Features, approaches, and processes can change or have limited capabilities, before General Availability (GA).
20+
> * For more information, *see* [**Supplemental Terms of Use for Microsoft Azure Previews**](https://azure.microsoft.com/support/legal/preview-supplemental-terms).
21+
22+
Azure AI Content Understanding classifier enables you to detect and identify documents you process within your application. Content Understanding classifier performs classification of an input file one page at a time to identify the documents within and can also identify multiple documents or multiple instances of a single document within an input file.
23+
24+
## Business use cases
25+
26+
Classifier can process complex documents in various formats and templates:
27+
28+
* **Invoices**: Categorize invoices from multiple vendors to process each category with a different Content Understanding analyzer if needed.
29+
* **Tax documents**: Categorize multiple tax documents into different types of tax forms such as 1040, 1099, etc.
30+
* **Contracts**: Long, unstructured contracts can now be categorized to streamline operations to understand different types of agreements and their specific legal implications.
31+
32+
33+
## Content Understanding classifier capabilities
34+
35+
Content Understanding classifier can analyze a single- or multi-file documents to identify if an input file can be classified into a category as defined. Here are the currently supported scenarios:
36+
37+
* A single file containing one document type, such as a loan application form.
38+
* A single file containing multiple document types. For instance, a loan application package that contains a loan application form, payslip, and bank statement.
39+
* A single file containing multiple instances of the same document. For instance, a collection of scanned invoices.
40+
41+
### How to use Content Understanding classifier
42+
43+
Content Understanding classifier doesn't require any training dataset. Define up to 50 category name and description and create a classifier. By default, the entire file is treated as a single content object, meaning the file/object is associated to a single category.
44+
45+
However, when you have more than one document in a file, the classifier can identify the different document types contained within the input file with splitting capability. The classifier response contains the page ranges for each of the identified document types contained within a file. This response can include multiple instances of the same document type.
46+
47+
When you call the classifier, the `analyze` operation includes a `splitMode` property that gives you granular control over the splitting behavior.
48+
49+
* To treat the entire input file as a single document for classification set the `splitMode` to `none`. When you do so, the service returns just one category for the entire input file.
50+
* To classify each page of the input file, set the `splitMode` to `perPage`. The service attempts to classify each page as an individual document.
51+
* Set the `splitMode` to `auto` and the service identifies the documents and associated page ranges.
52+
53+
### Optional analysis
54+
55+
For a complete end to end flow, you may link classifier categories with existing analyzers. For each content object classified to categories with linked analyzers, the service automatically invokes analysis on the content object using the corresponding analyzer. As an example, this linking can be used to create classifiers that identify and analyze only invoices from a PDF that may contain multiple types of forms in a document.
56+
57+
* Set the `analyzerId` to an existing analyzer to route and perform field extraction from the classified documents or pages.
58+
59+
### Classifier limits
60+
61+
* Classifier requires at least one distinct category to be defined. Response contains the page ranges for each of the categories of documents identified.
62+
63+
* The maximum allowed number of categories is 50.
64+
65+
* The maximum length of input file is 300 pages.
66+
67+
* For each category name and description, there's a limit of 120 characters combined.
68+
69+
* By default, there's an `$other` class as well, which we utilize to categorize the pages into for cases where any of the defined categories doesn't seem suitable.
70+
71+
Classifier categorizes each page of the input document, unless specified, to one of the defined categories. You can specify the page numbers to analyze in the input document as well.
72+
73+
For detailed information on supported input document formats, refer to our [Service quotas and limits](../service-limits.md) page.
74+
75+
76+
### Best practices
77+
78+
To improve classification and splitting quality, it's important to give a good category name and description so the model can understand the categories with some context. For more information on category names and descriptions, *see* [Best practices](../concepts/best-practices.md#classifier-category-names-and-descriptions).
79+
80+
## Key benefits
81+
82+
* **Accuracy and reliability:** Ensure precise document classification, reducing errors and boosting efficiency.
83+
* **Scalability:** Seamlessly scale out document processing to meet business demands.
84+
* **Customizable:** Adapt document classifier to fit specific workflows.
85+
86+
## Supported languages and regions
87+
For a detailed list of supported languages and regions, visit our [Language and region support](../language-region-support.md) page.
88+
89+
## Data privacy and security
90+
Developers using Content Understanding should review Microsoft's policies on customer data. For more information, visit our [Data, protection, and privacy](https://www.microsoft.com/trust-center/privacy) page.
91+
92+
## Next step
93+
* Try processing your document content using Content Understanding in [Azure AI Foundry](https://aka.ms/cu-landing).
94+
* Learn to analyze document content [**analyzer templates**](../quickstart/use-ai-foundry.md).

articles/ai-services/content-understanding/document/overview.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,6 @@ manager: nitinme
88
ms.service: azure-ai-content-understanding
99
ms.topic: overview
1010
ms.date: 05/19/2025
11-
ms.custom: ignite-2024-understanding-release
1211
---
1312

1413
# Content Understanding document solutions (preview)

articles/ai-services/content-understanding/glossary.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,3 +26,4 @@ ms.author: lajanuar
2626
| **Span** | A reference indicating the location of an element (for example, field, word) within the extracted Markdown content. A character offset and length represent a span. Different programming languages use various character encodings, which can affect the exact offset and length values for Unicode text. To avoid confusion, spans are only returned if the desired encoding is explicitly specified in the request. Some elements can map to multiple spans if they aren't contiguous in the markdown (for example, page). |
2727
| **Grounding source** | The specific regions in content where a value was generated. It has different representations depending on the file type: <br>&bullet; **Image** - A polygon in the image, often an axis-aligned rectangle (bounding box). <br>&bullet; **PDF/TIFF** - A polygon on a specific page, often a quadrilateral. <br>&bullet; **Audio** - A start and end time range. <br>&bullet; **Video** - A start and end time range with an optional polygon in each frame, often a bounding box.|
2828
| **Confidence score** | The level of certainty that the extracted data is accurate. |
29+
| **Category** | A distinct class within a classifier used to group similar input files based on shared characteristics or features. |

articles/ai-services/content-understanding/image/overview.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,6 @@ manager: nitinme
88
ms.service: azure-ai-content-understanding
99
ms.topic: how-to
1010
ms.date: 05/19/2025
11-
ms.custom: ignite-2024-understanding-release
1211
---
1312

1413
# Content Understanding image solutions (preview)

articles/ai-services/content-understanding/overview.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,6 @@ manager: nitinme
88
ms.service: azure-ai-content-understanding
99
ms.topic: overview
1010
ms.date: 05/19/2025
11-
ms.custom: ignite-2024-understanding-release
1211

1312
#customer intent: As a user, I want to learn more about Content Understanding solutions.
1413
---

articles/ai-services/content-understanding/quickstart/use-ai-foundry.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,6 @@ manager: nitinme
77
ms.service: azure-ai-content-understanding
88
ms.topic: quickstart
99
ms.date: 05/19/2025
10-
ms.custom: ignite-2024-understanding-release
1110
---
1211

1312
# Use Content Understanding in the Azure AI Foundry

articles/ai-services/content-understanding/service-limits.md

Lines changed: 20 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,6 @@ manager: nitinme
77
ms.service: azure-ai-content-understanding
88
ms.topic: conceptual
99
ms.date: 05/19/2025
10-
ms.custom: ignite-2024-understanding-release
1110
ms.author: lajanuar
1211
---
1312

@@ -89,15 +88,33 @@ The following limits apply as of version 2024-12-01-preview.
8988

9089
### Classification fields
9190

91+
> [!NOTE]
92+
> This classification field is the one within the extraction capability and not the separate [Content Understanding classifier](concepts/classifiers.md) itself.
93+
9294
Classification fields can be defined to return either a single category (single-label classification) or multiple categories (multi-label classification).
9395

9496
* **Single-label classification**: Defined using a string field with the `classify` method. It can be a top-level basic field or a subfield within a group or table.
9597
* **Multi-label classification**: Represented as a list of string fields with the `classify` method. In the [REST API](/rest/api/contentunderstanding/operation-groups?view=rest-contentunderstanding-2024-12-01-preview&preserve-view=true), `method=classify` and `enum` are specified on the inner string field and can only be a top-level field.
9698

97-
*Note: Document analyzers currently don't support classification fields.*
98-
9999

100100
## Training limits
101101
| File type| Max training data |
102102
| ---| --- |
103103
| Document | 1 GB total<br>50k pages/images |
104+
105+
## Classifier limits
106+
107+
The following limits apply as of version 2025-05-01-preview.
108+
109+
### Input File Limits (Documents only)
110+
111+
| Supported File Types | File Size | Length |
112+
| --- | --- | --- |
113+
|`.pdf`<br> ✓ `.tiff`<br> ✓ `.jpg`<br> ✓ `.png`<br> ✓ `.bmp`<br> ✓ `.heif` | ≤ 200 MB | ≤ 300 pages |
114+
|`.txt` | ≤ 1 MB | ≤ 1M characters |
115+
116+
### Category Limits
117+
118+
* **Category Name and Description**: Limit of total 120 characters for each category name and description combined.
119+
* **Category Name**: Category name can't start with `$`.
120+
* **Number of categories**: Minimum 1 to maximum 50 categories per classifier.

articles/ai-services/content-understanding/toc.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -65,6 +65,9 @@ items:
6565
- name: Accuracy and confidence
6666
displayName: accuracy, confidence, analyzers, optimization, fields, scores
6767
href: concepts/accuracy-confidence.md
68+
- name: 🆕 Classifiers
69+
displayName: classifiers, text, images, video, audio, multimodal, visual, structured, content, field, extraction
70+
href: concepts/classifiers.md
6871
- name: Retrieval-augmented generation (RAG)
6972
displayName: RAG, retrieval, augmented, generation, knowledge, base, search, index, vector
7073
href: concepts/retrieval-augmented-generation.md

0 commit comments

Comments
 (0)