Skip to content

Commit 9ec8284

Browse files
committed
add paul hsu updates
1 parent c22e58f commit 9ec8284

File tree

9 files changed

+188
-141
lines changed

9 files changed

+188
-141
lines changed

articles/ai-services/content-understanding/audio/overview.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
3838

3939
* **Transcription**. Converts conversational audio into searchable and analyzable text-based transcripts in WebVTT format. Customizable fields can be generated from transcription data. Sentence-level and word-level timestamps are available upon request.
4040

41-
* **Diarization**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
41+
* **`Diarization`**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
4242

4343
* **Speaker role detection**. Identifies agent and customer roles within contact center call data.
4444

@@ -53,11 +53,11 @@ Content Understanding offers advanced audio capabilities, including:
5353

5454
* **Customizable data extraction**. Tailor the output to your specific needs by modifying the field schema, allowing for precise data generation and extraction.
5555

56-
* **Generative models**. Utilize generative AI models to specify in natural language the content you want to extract, and the service will generate the desired output.
56+
* **Generative models**. Utilize generative AI models to specify in natural language the content you want to extract, and the service generates the desired output.
5757

5858
* **Integrated pre-processing**. Benefit from built-in preprocessing steps like transcription, diarization, and role detection, providing rich context for generative models.
5959

60-
* **Scenario adaptability**. Adapt the service to your requirements by generating custom fields to extract relevant data.
60+
* **Scenario adaptability**. Adapt the service to your requirements by generating custom fields and extract relevant data.
6161

6262
## Content Understanding audio analyzer templates
6363

articles/ai-services/content-understanding/document/overview.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ ms.custom: ignite-2024-understanding-release
2121
2222
Content Understanding is a cloud-based [Azure AI Service](../../what-are-ai-services.md) designed to efficiently extract content and structured fields from documents and forms. It provides a comprehensive suite of APIs and an intuitive UX experience for optimal efficiency.
2323

24-
Using Content Understanding, organizations can streamline data collection and processing, enhancing operational efficiency, data-driven decision-making, and innovation. With customizable analyzers, Content Understanding allows for easy extraction of content or fields from documents and forms, tailored to specific business needs.
24+
Content Understanding enables organization to streamline data collection and processing, enhance operational efficiency, optimize data-driven decision making, and empower innovation. With customizable analyzers, Content Understanding allows for easy extraction of content or fields from documents and forms, tailored to specific business needs.
2525

2626
## Business use cases
2727

@@ -41,7 +41,7 @@ Content extraction enables the extraction of both printed and handwritten text f
4141

4242
### Add-on capabilities
4343

44-
Enhance your document extraction with optional add-on features, which may incur additional costs. These features can be enabled or disabled based on your needs. Currently supported add-ons include:
44+
Enhance your document extraction with optional add-on features, which can incur added costs. These features can be enabled or disabled based on your needs. Currently supported add-ons include:
4545

4646
* **Layout**: Extracts layout information such as paragraphs, sections, tables, and more.
4747
* **Barcode**: Identifies and decodes all barcodes in the documents.

articles/ai-services/content-understanding/glossary.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: laujan
66
manager: nitinme
77
ms.service: azure
88
ms.topic: conceptual
9-
ms.date: 11/5/2024
9+
ms.date: 11/19/2024
1010
ms.author: lajanuar
1111
---
1212

@@ -19,10 +19,10 @@ ms.author: lajanuar
1919
| **Analyzer** | A component that processes and extracts content and structured fields from files. Content Understanding offers a few analyzer templates for common scenarios. |
2020
| **Analyzer template** | A predefined configuration and field schema for an analyzer. It simplifies creating analyzers by allowing modifications to a template instead of starting from scratch. This feature is available only in AI Foundry, not via REST API/SDKs. |
2121
| **Analyzer result** | The output generated by an analyzer after processing input data. It typically includes extracted content in Markdown, extracted fields, and optional modality-specific details. |
22-
| **Add-ons** | Additional features that enhance content extraction results, such as layout elements, barcodes, and figures in documents. |
22+
| **Add-ons** | Added features that enhance content extraction results, such as layout elements, barcodes, and figures in documents. |
2323
| **Fields** | List of structured key-value pairs derived from the content, as defined by the field schema. [Learn more about supported field value types.](service-limits.md) |
2424
| **Field schema** | A formal description of the fields to extract from the input. It specifies the name, description, value type, generation method, and more for each field. |
2525
| **Generation method** | The process of determining the extracted value of a specified field. Content Understanding supports: <br/> &bullet; **Extract**: Directly extract values from the input content, such as dates from receipts or item details from invoices. <br/> &bullet; **Classify**: Classify content into predefined categories, such as call sentiment or chart type. <br/> &bullet; **Generate**: Generate values from input data, such as summarizing an audio conversation or generating scene descriptions from videos. |
26-
| **Span** | A reference indicating the location of an element (e.g., field, word) within the extracted Markdown content. It is represented by a character offset and length. Different programming languages use various character encodings, which may affect the exact offset and length values for Unicode text. To avoid confusion, spans are only returned if the desired encoding is explicitly specified in the request. Some elements may map to multiple spans if they are not contiguous in the markdown (e.g., page). |
26+
| **Span** | A reference indicating the location of an element (for example, field, word) within the extracted Markdown content. A character offset and length represent a span. Different programming languages use various character encodings, which can affect the exact offset and length values for Unicode text. To avoid confusion, spans are only returned if the desired encoding is explicitly specified in the request. Some elements can map to multiple spans if they aren't contiguous in the markdown (for example, page). |
2727
| **Grounding source** | The specific regions in content where a value was generated. It has different representations depending on the file type: <br>&bullet; **Image** - A polygon in the image, often an axis-aligned rectangle (bounding box). <br>&bullet; **PDF/TIFF** - A polygon on a specific page, often a quadrilateral. <br>&bullet; **Audio** - A start and end time range. <br>&bullet; **Video** - A start and end time range with an optional polygon in each frame, often a bounding box.|
2828
| **Confidence score** | The level of certainty that the extracted data is accurate. |

articles/ai-services/content-understanding/image/overview.md

Lines changed: 34 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -19,15 +19,15 @@ ms.custom: ignite-2024-understanding-release
1919
> * Features, approaches, and processes may change or have constrained capabilities, prior to General Availability (GA).
2020
> * For more information, *see* [**Supplemental Terms of Use for Microsoft Azure Previews**](https://azure.microsoft.com/support/legal/preview-supplemental-terms).
2121
22-
Content Understanding standardizes the extraction of data from images, making it easier to analyze large volumes of unstructured data. This speeds up time-to-value and simplifies integration into downstream analytical workflows. With the Content Understanding APIs, you can define schema to specify the fields, descriptions, and output types for extraction. The service then analyses the images and provides structured data, which can be applied in various use cases, such as:
22+
Azure AI Content Understanding standardizes the extraction of data from images, making it easier to analyze large volumes of unstructured data. Standardized extraction speeds up time-to-value and simplifies integration into downstream analytical workflows. With the Content Understanding APIs, you can define schema to specify the fields, descriptions, and output types for extraction. The service then analyses the images and provides structured data, which can be applied in various use cases, such as:
2323

2424
* **Retrieval-augmented generation (RAG) applications:** Extract key details from your images to build a robust index that powers user-facing chat experiences. This index enables users to ask questions and receive accurate answers based on the content of your images.
2525

2626
* **Financial analysis and business intelligence:** Analyze business performance charts and trends to generate real-time reports that help analysts, managers, and executives make faster, more informed decisions.
2727

2828
* **Manufacturing quality control:** Automate the detection of defects and anomalies, such as scratches, cracks, or misalignments, in production lines and manufacturing environments.
2929

30-
* **Shelf analysis and inventory management:** Detect, count, and extract specific details about retail products, optimizing operations and improving customer satisfaction by ensuring products are well-stocked and properly organized.
30+
* **Shelf analysis and inventory management:** Detect, count, and extract specific details about retail products, optimizing operations, and improving customer satisfaction by ensuring products are well-stocked and properly organized.
3131

3232
## Key Benefits
3333

@@ -37,7 +37,7 @@ Content Understanding offers several key benefits for extracting information fro
3737

3838
* **Improved accuracy for specific use cases:** Content Understanding enables targeted data extraction that aligns directly with your unique requirements, helping to improve model accuracy by focusing on the most important data points.
3939

40-
* **Faster and more cost-effective automation:** By extracting only the necessary fields, Content Understanding streamlines automation, allowing organizations to scale their data processing workflows efficiently and reduce the storage and processing of irrelevant data.
40+
* **Faster and more cost-effective automation:** The extracting of only the necessary fields enables Content Understanding to streamlines automation. Thus allowing organizations to scale their data processing workflows efficiently and reduce the storage and processing of irrelevant data.
4141

4242

4343
## Input requirements
@@ -51,12 +51,39 @@ For detailed information on supported input document formats, refer to our [Serv
5151
## Supported languages and regions
5252
For a detailed list of supported languages and regions, visit our [Language and region support](../language-region-support.md) page.
5353

54+
## Supported file formats
55+
56+
> [!NOTE]
57+
> For best results, image schema should only be used to process non-document-based images.
58+
> Text heavy images of documents should be processed using a document schema.
59+
> Use cases that require extraction of text from document images or scanned documents in image formats should be processed using a document field extraction schema.
60+
61+
Content Understanding supports the following image file formats in preview:
62+
63+
|Supported file types| File size| Resolution (pixels)|
64+
|---|---|---|
65+
.jpg, .png, .bmp, .heif|≤ 20 MB | Minimum: 50 x 50 px</be>Maximum: 10,000 x 10,000 px|
66+
67+
## Supported field types
68+
69+
| Data type|Supported format|Schema limits|Example|
70+
| --- | --- |---|
71+
| **String**| √ Plain Text||
72+
|**Date** | √ Normalized to ISO 8601 (YYYY-MM-DD) format|2023-10-31|
73+
| **Time**| √ Normalized to ISO 8601 (hh:mm:ss) format|14:30:00|
74+
| **number**| √ Float number normalized to double precision floating point|1.14159|
75+
| **Integer**| √ Integer number, normalized to 64-bit signed integer|42, 1024|
76+
| **Boolean**| √ Boolean value, normalized to `true` or `false`||
77+
| **array**| √ List of subfields of the same type||
78+
| **Object**| √ Named list of subfields of potentially different types. ||
79+
5480
## Data privacy and security
55-
Developers using Content Understanding should review Microsoft's policies on customer data. For more information, visit our [Data, protection, and privacy](https://www.microsoft.com/trust-center/privacy) page.
81+
82+
As with all the Azure AI services, developers using the Content Understanding service should be aware of Microsoft's policies on customer data. See our [**Data, protection and privacy**](https://www.microsoft.com/trust-center/privacy) page to learn more.
5683

5784
> [!IMPORTANT]
5885
> If you are using Microsoft products or services to process Biometric Data, you are responsible for: (i) providing notice to data subjects, including with respect to retention periods and destruction; (ii) obtaining consent from data subjects; and (iii) deleting the Biometric Data, all as appropriate and required under applicable Data Protection Requirements. "Biometric Data" will have the meaning set forth in Article 4 of the GDPR and, if applicable, equivalent terms in other data protection requirements. For related information, see [Data and Privacy for Face](/legal/cognitive-services/face/data-privacy-security).
5986
60-
## Next step
61-
* Try processing your image content using Content Understanding in [Azure AI Foundry](https://ai.azure.com/).
62-
* Learn more about image [**analyzer templates**](../quickstart/use-ai-foundry.md).
87+
## Next steps
88+
89+
Try processing your content and data using Content Understanding in the [Azure AI Studio](https://ai.azure.com/?tid=888d76fa-54b2-4ced-8ee5-aac1585adee7).

articles/ai-services/content-understanding/overview.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ Content Understanding offers a streamlined process to reason over large amounts
2727

2828
## Why process with Content Understanding?
2929

30-
* **Simplify and streamline workflows**. Content Understanding standardizes the extraction of content, structure, and insights from various content types into a unified process.
30+
* **Simplify and streamline workflows**. Azure AI Content Understanding standardizes the extraction of content, structure, and insights from various content types into a unified process.
3131

3232
* **Simplify field extraction**. Content Understanding's field extraction makes it easier to generate structured output from unstructured content. Define a schema to extract, classify, or generate field values with no complex prompt engineering
3333

@@ -60,15 +60,15 @@ See [Quickstart](quickstart/use-ai-foundry.md) for more examples.
6060
|Component|Description|
6161
|:---------|:----------|
6262
|Analyzer|The analyzer is the core component of Content Understanding. It allows customers to configure content extraction settings and field extraction schema. Once configured, the analyzer consistently applies these settings to process all incoming data.|
63-
|Content extraction|Content extraction enables users to specify the types of information to be identified and extracted from incoming content. This includes options such as OCR for text, layout analysis, barcodes, tables, and more, allowing users to focus on the most relevant content elements.|
64-
|Add-ons| Content Understanding add-ons enhance content extraction by incorporating additional elements like barcodes, tables, and detected faces.|
65-
|Field extraction|Field extraction allows users to define the structure and schema of the desired fields to extract from input files. See [service limits](service-limits.md) for a complete list of field types supported. Fields can be generated via one of the following methods:</br></br>&bullet; **Extract**: Directly extract values as they appear in the input content, such as dates from receipts or item details from invoices.</br></br>&bullet; **Classify**: Classify content from a predefined set of categories, such as call sentiment or chart type.</br></br>&bullet; **Generate**: Generate values freely from input data, such as summarizing an audio conversation or creating scene descriptions from videos.|
66-
|Grounding source| Content Understanding identifies the specific regions in the content where the value was generated from. This allows users in automation scenarios to quickly verify the correctness of the field values, leading to higher confidence in the extracted data. |
63+
|Content extraction|Content extraction enables users to specify the types of information to be identified and extracted from incoming content. User-specified information includes options such as `OCR` for text, layout analysis, barcodes, tables, and more, allowing users to focus on the most relevant content elements.|
64+
|Add-ons| Content Understanding add-ons enhance content extraction by incorporating added elements like barcodes, tables, and detected faces.|
65+
|Field extraction|Field extraction allows users to define the structure and schema of the desired fields to extract from input files. See [service limits](service-limits.md) for a complete list of field types supported. Fields can be generated via one of the following methods:</br></br> &bullet; **Extract**: Directly extract values as they appear in the input content, such as dates from receipts or item details from invoices.</br></br>&bullet; **Classify**: Classify content from a predefined set of categories, such as call sentiment or chart type.</br></br>&bullet; **Generate**: Generate values freely from input data, such as summarizing an audio conversation or creating scene descriptions from videos.|
66+
|Grounding source| Content Understanding identifies the specific regions in the content where the value was generated from. Source grounding allows users in automation scenarios to quickly verify the correctness of the field values, leading to higher confidence in the extracted data. |
6767
|Confidence score | Content Understanding provides confidence scores from 0 to 1 to estimate the reliability of the results. High scores indicate accurate data extraction, enabling straight-through processing in automation workflows.|
6868

6969

7070
## Responsible AI
71-
Azure AI Content Understanding is designed to guard against processing harmful content. For more information, *see* our [**Transparency Note**]() and our [**Code of Conduct**](/legal/cognitive-services/openai/code-of-conduct).
71+
Azure AI Content Understanding is designed to guard against processing harmful content. For more information, *see* our **Transparency Note** and our [**Code of Conduct**](/legal/cognitive-services/openai/code-of-conduct).
7272

7373
## Data privacy and security
7474
Developers using the Content Understanding service should review Microsoft's policies on customer data. For more information, visit our [**Data, protection and privacy**](https://www.microsoft.com/trust-center/privacy) page.

0 commit comments

Comments
 (0)