Merge pull request #1317 from laujan/5-content-understanding

denrea · web-flow · commit a3ddeec9e531 · 2024-11-06T09:44:34.000-08:00
5 content understanding
diff --git a/articles/ai-services/content-understanding/faq.yml b/articles/ai-services/content-understanding/faq.yml
@@ -11,14 +11,74 @@ metadata:
 title: Frequently asked questions
 summary: |
 
- Azure AI Content Understanding is a cloud-based data solution designed to process both structured and unstructured content across various modalities, including documents, images, videos, and audio.
-
+ Find answers to commonly asked questions about Azure AI Content Understanding
 sections:
   - name: Overview
     questions:
       - question: |
-         Can I continue to use Document Intelligence v4.0 capabilities?
+         What is Content Understanding
+        answer: |
+          Content Understanding is a new Azure AI Service designed to generate structured insights from unstructured content using artificial intelligence. It provides consistent experience to extract content or a structured schema from audio, video, images, documents, or text inputs.
+      - question: |
+         How does Content Understanding work?
+        answer: |
+          Content Understanding utilizes Generative AI models to analyze and interpret various forms of unstructured content. It integrates data from different modalities (for example, text, images, audio) to generate a cohesive and structured output. The service uses machine learning models trained on diverse datasets and generative AI models to ensure high accuracy and relevance in the insights provided.
+      - question: |
+         What types of unstructured content can Content Understanding process?
+        answer: |
+          Content Understanding can process a wide range of unstructured content, including but not limited to:
+          * Audio recordings
+          * Video content
+          * Documents
+          * Text content
+          * Images
+      - question: |
+         What are the key benefits of using Content Understanding?
+        answer: |
+          The key benefits of using Content Understanding include:
+          * Confidence scores: Ensure the accuracy of extracted values while minimizing the cost of human review.
+          * Defined schema: Define a schema to ensure the extracted values align with intended use.
+          * Quality improvements over time: The service provides capabilities to improve the quality of the schema extracted.
+          * Improved decision-making: Structured insights help organizations make informed decisions quickly and effectively.
+          * Increased efficiency: Automating the analysis of unstructured content saves time and reduces the manual effort required.
+          * Scalability: The service can handle large volumes of data, making it suitable for organizations of all sizes.
+      - question: |
+         How can businesses use Content Understanding?
+        answer: |
+          Businesses can use Content Understanding in various ways, such as:
+          * Automation: Automate processing of content to extract a defined schema. Call center, documents, and other similar scenarios.
+          * Content cataloging: managing a large corpus of digital assets.
+          * Customer sentiment analysis: Understanding customer feedback from reviews, social media, and support interactions.
+          * Market research: Analyzing trends and patterns from diverse data sources to inform business strategies.
+          * Operational insights: Gain insights from internal documents, emails, and other unstructured data to improve operations.
+      - question: |
+         Is Content Understanding easy to integrate with existing systems?
+        answer: |
+          Yes, Content Understanding easily integrates with existing systems and workflows. The service offers a set of easy-to-use APIs that can be integrated into any application.
+      - question: |
+         What security measures are in place to protect data processed by Content Understanding?
+        answer: |
+         Azure AI Services, including Content Understanding, adheres to strict security and compliance standards to ensure data protection. These measures include data encryption, secure access controls, and compliance with industry regulations such as GDPR and HIPAA. The service also adheres to Microsoft’s responsible use of AI.
+      - question: |
+         How do the capabilities of Azure AI Content Understanding compare to Document Intelligence
+        answer: |
+          Azure AI Content Understanding and Document Intelligence are both powerful tools, but they serve different purposes and have distinct capabilities.
+          Azure AI Content Understanding integrates various data types like text, images, videos, and audio, providing comprehensive analysis and insights using Generative AI. The service is ideal for applications needing diverse data integration for automation, Search, and Retrieval-Augmented Generation (RAG), analytics, and reporting.
+          Conversely, Document Intelligence focuses on extracting and processing key data from documents, such as invoices, forms, and contracts, converting unstructured data into structured, usable information.
+      - question: |
+         How can I migrate from Document Intelligence to Azure AI Content Understanding?
+        answer: |
+          Currently, migration from Document Intelligence to Content Understanding is unavailable.
+      - question: |
+         What base models does Azure AI Content Understanding use?
+        answer: |
+          Content Understanding uses various models and capabilities from Azure OpenAI, Azure AI Speech, Vision, and Language to support single- modality and multi-modal scenarios. The service determines the selection of base models appropriate for each scenario.
+      - question: |
+         What are the pricing tier options for Content Understanding
+        answer: |
+          Content Understanding only supports Standard S0 pricing tier. See more details on the pricing page.
+      - question: |
+         How can I get started with Content Understanding?
         answer: |
-          **Yes.**
+          To get started with Content Understanding, visit the Azure AI Studio and get started with Content Understanding. Azure AI Studio provides comprehensive guides, tutorials, and customer support to help you set up and utilize Content Understanding effectively.
 
-          Current users of Document Intelligence can continue using the service during the preview development phase of the Multimodal service. whats-new.md Document Intelligence v4.0 becomes generally available (GA), its features are integrated with the Content Understanding service. Future enhancements related to document scenarios are then accessible via the Content Understanding service. Existing customers can transition to Content Understanding with minimal disruption.
diff --git a/articles/ai-services/content-understanding/image/overview.md b/articles/ai-services/content-understanding/image/overview.md
@@ -65,6 +65,13 @@ Content Understanding supports the following image file formats in preview:
 | **array**| √ List of subfields of the same type||
 | **Object**| √ Named list of subfields of potentially different types. ||
 
+## Data privacy and security
+
+As with all the Azure AI services, developers using the Content Understanding service should be aware of Microsoft's policies on customer data. See our [**Data, protection and privacy**](https://www.microsoft.com/trust-center/privacy) page to learn more.
+
+> [!IMPORTANT]
+> If you are using Microsoft products or services to process Biometric Data, you are responsible for: (i) providing notice to data subjects, including with respect to retention periods and destruction; (ii) obtaining consent from data subjects; and (iii) deleting the Biometric Data, all as appropriate and required under applicable Data Protection Requirements. "Biometric Data" will have the meaning set forth in Article 4 of the GDPR and, if applicable, equivalent terms in other data protection requirements. For related information, see [Data and Privacy for Face](/legal/cognitive-services/face/data-privacy-security).
+
 ## Next steps
 
 Try processing your content and data using Content Understanding in the [Azure AI Studio](https://ai.azure.com/?tid=888d76fa-54b2-4ced-8ee5-aac1585adee7).
diff --git a/articles/ai-services/content-understanding/media/overview/content-understanding-overview.png b/articles/ai-services/content-understanding/media/overview/content-understanding-overview.png
diff --git a/articles/ai-services/content-understanding/overview.md b/articles/ai-services/content-understanding/overview.md
@@ -15,22 +15,28 @@ ms.custom: ignite-2024-understanding-release
 
 Azure AI Content Understanding is a cloud-based solution within [**Azure AI services**](../what-are-ai-services.md), designed to process/ingest various data modalities such as documents, images, videos, and audio into customizable output formats using Generative AI, Larger Language models (LLM), and Small Language Models (SLM) within a unified workflow.
 
-Built on the success of Document Intelligence, Content Understanding offers a streamlined process to reason over large amounts of unstructured data, build customizable workflows, ultimately accelerating time-to-value (TTV), while varied AI models.
+Content Understanding offers a streamlined process to reason over large amounts of unstructured data, build customizable workflows, ultimately accelerating time-to-value (TTV), while varied AI models.
 
-:::image type="content" source="media/overview/content-understanding-process.png" lightbox="media/overview/content-understanding-process.png" alt-text="Screenshot of accepted media input files.":::
+:::image type="content" source="media/overview/content-understanding-overview.png" lightbox="media/overview/content-understanding-process.png" alt-text="Screenshot of accepted media input files.":::
 
-### Benefits of using Content Understanding
+### Why use Content Understanding?
 
-* **Simplified and streamlined workflows**. Content Understanding simplifies data extraction from mixed modality and unstructured content by eliminating the need for separate workflows.
+* **Simplified and streamlined workflows**. Content Understanding unifies the process for extracting data from any modality or combination of modalities, creating a unified approach to processing all types of content.
 
-   :::image type="content" source="media/overview/content-understanding-workflow.png" alt-text="Screenshot comparing Content Understanding workflows.":::
+* **Simplified Content Extraction**. Content Understanding's schema definition streamlines the generation of structured output from various content types. Users are enabled to define schemas where fields can be extracted, inferred, or abstracted without requiring complex prompt engineering.
 
 * **Efficiency and Cost Reduction**. Automating the ingestion and analysis of large amounts of data from varied sources reduces the cost associated with building Generative AI automation solutions.
 
 * **Enhanced Accuracy**. Content Understanding uses multiple data modalities to simultaneously analyze and cross-validate information, leading to more accurate and reliable results.
 
 ### Content Understanding use cases
 
+* **Automation**. Content Understanding can significantly enhance automation by transforming unstructured content into structured data, which can then be seamlessly integrated into various downstream workflows and applications. For example, it can automate procurement and payment processes by extracting fields from invoices.
+
+* **Search and Retrieval Augmented Generation**. Content Understanding enhances Search and Retrieval-Augmented Generation (RAG) by processing diverse unstructured content. The output can be added to a search index and RAG applications, enhancing the search experience with more accurate and relevant results.
+
+* **Analytics and Reporting**: Content Understanding's extracted schema outputs enhance analytics and reporting, allowing businesses to gain valuable insights, conduct deeper analysis and make informed decisions from accurate reports.
+
 *    **Business leaders and c-suite executives**. Decision makers gain actionable insights from Content Understanding solutions. Generative
 AI powered results and high confidence scores lead to enlightened data-driven decisions and minimize the need for human review.
 
@@ -66,7 +72,7 @@ At Microsoft, we prioritize advancing AI with a people-first approach. Generativ
 As with all the Azure AI services, developers using the Content Understanding service should be aware of Microsoft's policies on customer data. See our [**Data, protection and privacy**](https://www.microsoft.com/trust-center/privacy) page to learn more.
 
 > [!IMPORTANT]
-> if you are using Microsoft products or services to process Biometric Data, you are responsible for: (i) providing notice to data subjects, including with respect to retention periods and destruction; (ii) obtaining consent from data subjects; and (iii) deleting the Biometric Data, all as appropriate and required under applicable Data Protection Requirements. "Biometric Data" will have the meaning set forth in Article 4 of the GDPR and, if applicable, equivalent terms in other data protection requirements. For related information, see [Data and Privacy for Face](/legal/cognitive-services/face/data-privacy-security).
+> If you are using Microsoft products or services to process Biometric Data, you are responsible for: (i) providing notice to data subjects, including with respect to retention periods and destruction; (ii) obtaining consent from data subjects; and (iii) deleting the Biometric Data, all as appropriate and required under applicable Data Protection Requirements. "Biometric Data" will have the meaning set forth in Article 4 of the GDPR and, if applicable, equivalent terms in other data protection requirements. For related information, see [Data and Privacy for Face](/legal/cognitive-services/face/data-privacy-security).
 
 ## Getting started
 Before you get started using Content Understanding, you need an [**Azure AI services multi-service resource**](how-to/create-multi-service-resource.md). The multi-service resource enables access to multiple Azure AI services with a single set of credentials.
diff --git a/articles/ai-services/content-understanding/service-limits.md b/articles/ai-services/content-understanding/service-limits.md
@@ -1,7 +1,7 @@
 ---
-title: Service quotas and limits - Multimodal Intelligence
+title: Service quotas and limits - Content Understanding
 titleSuffix: Azure AI services
-description: Quick reference, detailed description, and best practices for working within Azure AI Multimodal Intelligence service Quotas and Limits
+description: Quick reference, detailed description, and best practices for working within Azure AI Content Understanding service Quotas and Limits
 #services: cognitive-services
 author: laujan
 manager: nitinme
@@ -15,7 +15,7 @@ ms.author: lajanuar
 
 # Service limits and quotas
 
-This article provides both a quick reference and detailed description of Azure AI Multimodal Intelligence service quotas and limits.
+This article provides both a quick reference and detailed description of Azure AI Content Understanding service quotas and limits.
 
 ## File limits
 
@@ -38,18 +38,18 @@ Each modality covers a set of Multipurpose Internet Mail Extensions (MIME) file
 
 |Modality| Supported File Types | File Size | Resolution | Length |
 |--- | --- | --- | --- | --- |
-|**Audio** |   √  .wav (PCM, ALAW, MULAW) </br>√  .mp3 </br>√.opus, .ogg (Opus)</br>√.flac </br>√  .wma </br>√  .aac </br>√  .amr (AMR-NB, AMR-WB) </br>√.webm (Opus, Vorbis) </br>√  .m4a (AAC, ALAC)</br>√.spx | asynchronous:</br>≤ 200 MB |  | asynchronous:</br> ≤ 2 h |
+|**Audio** |   √  .wav (`PCM`, `ALAW`, M`ULAW`) </br>√  .mp3 </br>√.opus, .ogg (Opus)</br>√.flac </br>√  .wma </br>√  .aac </br>√  .amr (AMR-NB, AMR-WB) </br>√.webm (Opus, Vorbis) </br>√  .m4a (`AAC`, `ALAC`)</br>√.spx | asynchronous:</br>≤ 200 MB |  | asynchronous:</br> ≤ 2 h |
 
 ### Video
 
 |Modality| Supported File Types | File Size | Resolution | Length |
 |--- | --- | --- | --- | --- |
-|**Video** | √  .mp4, .m4v </br>√  .flv (with H.264 and AAC codecs) </br>√  .wmv, .asf </br>√  .avi (Uncompressed 8bit/10bit) </br>√  .mkv </br>√  .mov  | asynchronous:</br>≤2 GB (body) asynchronous:</br>≤20 GB (URL)| Min:</br>320 x 240</br></br>Max:</br>1920 x 1080 | asynchronous:</br>≤30 m (body)</br></br> asynchronous:</br>≤30 m (URL) |
+|**Video** | √  .mp4, .m4v </br>√  .flv (with H.264 and `AAC` codecs) </br>√  .wmv, .asf </br>√  .avi (Uncompressed 8bit/10bit) </br>√  .mkv </br>√  .mov  | asynchronous:</br>≤2 GB (body) asynchronous:</br>≤20 GB (URL)| Min: 320 x 240</br></br>Max:</br>1920 x 1080 | asynchronous:</br>≤30 m (body)</br></br> asynchronous:</br>≤30 m (URL) |
 
 
 ## Field Schema Limits
 
-A schema in Multimodal Intelligence refers to a defined structure specifying the types of data to be extracted from various types of unstructured content. Unstructured content types include documents, images, videos, and audio. This structured representation of data is crucial for enabling downstream applications to process and analyze the extracted information effectively.
+A schema in Content Understanding refers to a defined structure specifying the types of data to be extracted from various types of unstructured content. Unstructured content types include documents, images, videos, and audio. This structured representation of data is crucial for enabling downstream applications to process and analyze the extracted information effectively.
 
 This section details the limits of the field inputs for schema definition.
 
@@ -64,17 +64,15 @@ This section details the limits of the field inputs for schema definition.
 | **array**| √ List of subfields of the same type||
 | **Object**| √ Named list of subfields of potentially different types. | 10 (audio, image, video), 50 (document) |
 
-## Analyzer limits per resource
-
-Analyzers in Multimodal Intelligence are specialized components designed to process and extract structured data from various types of unstructured content, such as textual documents, audio, images, and video. These analyzers are tailored to handle specific types of data and tasks, ensuring that the extracted information is accurate and useful for downstream applications.
-
+## Training limits for Custom Document
 | Quota | Standard (S0) |
 | --- | --- |
-| Max models | 100k |
-| Max analysis/min | 1000 pages/images four, (4) hours of audio, 1 hour of video  |
-| Max operations/min | 3000 |
-| Free trainings / month | 10 hours |
 | Max training file size | 1 GB |
 | Max training length | 50k pages/images |
-| Max fields | 100 (document), 10(image, audio, video) |
-| Max enum values | 300 per schema |
+
+## Resource limits
+| Quota | Standard (S0) |
+| --- | --- |
+| Max analyzers | 100k |
+| Max analysis/min | 1000 pages/images four, (4) hours of audio, 1 hour of video  |
+| Max operations/min | 3000 |
diff --git a/articles/ai-services/content-understanding/toc.yml b/articles/ai-services/content-understanding/toc.yml
@@ -31,6 +31,7 @@ items:
     href: audio/overview.md
   - name: Video
     displayName: video, audio, voice, recognition, synthesis, speaker, identification, verification, diarization, transcription, translation, language, understanding, sentiment, analysis, emotion, detection, pronunciation, model
+    href: video/overview.md
   - name: Image
     displayName: image, OCR, optical character recognition, text, extraction, analysis, detection, recognition, model
     href: image/overview.md
diff --git a/articles/ai-services/content-understanding/video/overview.md b/articles/ai-services/content-understanding/video/overview.md