Skip to content

Commit 9a477ea

Browse files
committed
rag concepts
1 parent e4c0602 commit 9a477ea

File tree

7 files changed

+39
-40
lines changed

7 files changed

+39
-40
lines changed

articles/ai-services/content-understanding/concepts/retrieval-augmented-generation.md

Lines changed: 27 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,24 @@
11
---
2-
title: Azure AI Content Understanding Retrieval Augmented Generation Concept
2+
title: Azure AI Content Understanding retrieval-augmented generation Concept
33
titleSuffix: Azure AI services
4-
description: Learn about Retrieval Augmented Generation
4+
description: Learn about retrieval-augmented generation
55
author: laujan
66
ms.author: tonyeiyalla
77
manager: nitinme
88
ms.service: azure-ai-content-understanding
99
ms.topic: overview
10-
ms.date: 03/16/2025
11-
ms.custom: 2025-understanding-release
10+
ms.date: 04/23/2025
1211
---
13-
# Creating a Multimodal Retrieval Augmented Generation Solution with Content Understanding
1412

15-
# Introduction
13+
# Retrieval-augmented generation with Content Understanding
1614

17-
Retrieval Augmented Generation (RAG) enhances Generative AI models by grounding their responses in external knowledge sources, significantly improving accuracy, relevance, and reliability. A key challenge in RAG is effectively extracting and preparing multimodal content – documents, images, audio, and video – so that it can be accurately retrieved and used to inform the LLM's responses.
15+
retrieval-augmented generation (RAG) enhances Generative AI models by grounding their responses in external knowledge sources, significantly improving accuracy, relevance, and reliability. A key challenge in RAG is effectively extracting and preparing multimodal content – documents, images, audio, and video – so that it can be accurately retrieved and used to inform the LLM's responses.
1816

1917
Azure AI Content Understanding addresses these challenges by providing sophisticated extraction capabilities across all content modalities, preserving semantic integrity and contextual relationships that traditional extraction methods often lose. This unified approach eliminates the need to manage separate workflows and models for different content types, streamlining implementation while ensuring optimal representation for retrieval and generation.
2018

2119
## Why Does Multimodal Data Matter for RAG?
2220

23-
In traditional content processing, simple text extraction was sufficient for many use cases. However, modern enterprise environments contain rich, diverse information spread across multiple formats—documents with complex layouts, images conveying visual insights, audio recordings of crucial conversations, and videos that combine all these elements. For truly comprehensive Retrieval Augmented Generation (RAG) systems, all of this content must be accurately processed and made available to generative AI models. This ensures that when users pose questions, the underlying RAG system can retrieve relevant information regardless of its original format—whether it's a complex table in a financial report, a technical diagram in a manual, insights from a recorded conference call, or explanations from a training video.
21+
In traditional content processing, simple text extraction was sufficient for many use cases. However, modern enterprise environments contain rich, diverse information spread across multiple formats—documents with complex layouts, images conveying visual insights, audio recordings of crucial conversations, and videos that combine all these elements. For truly comprehensive retrieval-augmented generation (RAG) systems, all of this content must be accurately processed and made available to generative AI models. This ensures that when users pose questions, the underlying RAG system can retrieve relevant information regardless of its original format—whether it's a complex table in a financial report, a technical diagram in a manual, insights from a recorded conference call, or explanations from a training video.
2422

2523
## Capabilities of Content Understanding for Multimodal RAG
2624

@@ -50,7 +48,7 @@ A high level summary of RAG implementation pattern looks like this:
5048
3. Store embedded vectors in database or search index.
5149
4. Use Generative AI chat models to query and generate responses from retrieval systems.
5250

53-
Heres an overview of the implementation process, beginning with data extraction using Azure AI Content Understanding as the foundation for transforming raw multimodal data into structured, searchable formats optimized for RAG workflows:
51+
Here's an overview of the implementation process, beginning with data extraction using Azure AI Content Understanding as the foundation for transforming raw multimodal data into structured, searchable formats optimized for RAG workflows:
5452

5553
### 1. Content Extraction: The Foundation for RAG with Content Understanding
5654

@@ -337,16 +335,16 @@ Below is an example showcasing the results of content and field extraction using
337335
"valueString": "Maria Smith contacted Contoso to inquire about her current point balance. Agent John Doe confirmed her identity and informed her that she has 599 points. Maria did not require any further information and the call ended on a positive note."
338336
},
339337
"TrainingTopics": {
340-
"type": "array",
341-
"valueArray": [
342-
{
343-
"type": "string",
344-
"valueString": "Compliance"
345-
},
346-
{
347-
"type": "string",
348-
"valueString": "Risk mitigation"
349-
},]
338+
"type": "array",
339+
"valueArray": [
340+
{
341+
"type": "string",
342+
"valueString": "Compliance"
343+
},
344+
{
345+
"type": "string",
346+
"valueString": "Risk mitigation"
347+
},]
350348
},
351349
"People": {
352350
"type": "array",
@@ -416,16 +414,16 @@ Below is an example showcasing the results of content and field extraction using
416414
"valueString": "The video begins with a view from a glass floor, showing a person's feet in white sneakers standing on it. The scene captures a downward view of a structure, possibly a tower, with a grid pattern on the floor and a clear view of the ground below. The lighting is bright, suggesting a sunny day, and the colors are dominated by the orange of the structure and the gray of the floor."
417415
},
418416
"KeyTopics": {
419-
"type": "array",
420-
"valueArray": [
421-
{
422-
"type": "string",
423-
"valueString": "Flight delay"
424-
},
425-
{
426-
"type": "string",
427-
"valueString": "Customer service"
428-
},
417+
"type": "array",
418+
"valueArray": [
419+
{
420+
"type": "string",
421+
"valueString": "Flight delay"
422+
},
423+
{
424+
"type": "string",
425+
"valueString": "Customer service"
426+
},
429427
]
430428
}
431429
},

articles/ai-services/content-understanding/overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@ Content Understanding offers a streamlined process to reason over large amounts
4141

4242
* **Automation**. Content Understanding supports automation scenarios by converting unstructured content into structured data, which can be integrated into various workflows and applications. Confidence scores minimize human review and lower costs. For example, automate procurement and payment processes by extracting fields from invoices.
4343

44-
* **Search and retrieval augmented generation (RAG)**. Content Understanding enables ingestion of content of any modality into the search index. The structured output representation improves the relevance for RAG scenarios.
44+
* **Search and retrieval-augmented generation (RAG)**. Content Understanding enables ingestion of content of any modality into the search index. The structured output representation improves the relevance for RAG scenarios.
4545

4646
* **Analytics and reporting**: Content Understanding's extracted field outputs enhance analytics and reporting, allowing businesses to gain valuable insights, conduct deeper analysis, and make informed decisions based on accurate reports.
4747

articles/ai-services/content-understanding/toc.yml

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -65,13 +65,14 @@ items:
6565
- name: Accuracy and confidence
6666
displayName: accuracy, confidence, analyzers, optimization, fields, scores
6767
href: concepts/accuracy-confidence.md
68-
- name: Retrieval Augmented Generation (RAG)
68+
- name: Retrieval-augmented generation (RAG)
6969
displayName: RAG, retrieval, augmented, generation, knowledge, base, search, index, vector
7070
href: concepts/retrieval-augmented-generation.md
7171
- name: Tutorials
7272
items:
73-
- name: Retrieval Augmented Generation Tutorial
74-
href: tutorial/RAG-tutorial.md
73+
- name: Build a retrieval-augmented solution
74+
displayName: RAG, retrieval, augmented, generation, knowledge, base, search, index, vector
75+
href: tutorial/rag-tutorial.md
7576
- name: Responsible AI
7677
items:
7778
- name: Transparency note

articles/ai-services/content-understanding/tutorial/RAG-tutorial.md renamed to articles/ai-services/content-understanding/tutorial/retrieval-augmented-generation.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: Azure AI Content Understanding Retrieval Augmented Generation Tutorial
2+
title: Azure AI Content Understanding retrieval-augmented generation Tutorial
33
titleSuffix: Azure AI services
4-
description: Learn about Retrieval Augmented Generation
4+
description: Learn about retrieval-augmented generation
55
author: laujan
66
ms.author: tonyeiyalla
77
manager: nitinme
@@ -11,9 +11,9 @@ ms.date: 04/05/2025
1111
ms.custom: 2025-understanding-release
1212
---
1313

14-
# Tutorial: Building a Multimodal Retrieval Augmented Generation (RAG) Solution with Content Understanding
14+
# Tutorial: Building a Multimodal retrieval-augmented generation (RAG) Solution with Content Understanding
1515

16-
This tutorial provides a comprehensive guide to building a Retrieval Augmented Generation (RAG) solution using Azure AI Content Understanding. It explains the essential components required to design and implement a robust RAG system, highlights best practices for optimizing relevance and accuracy, and outlines the integration points with other Azure services. By the end of this tutorial, you will have a clear understanding of how to leverage Content Understanding to process multimodal data, enhance retrieval precision, and enable generative AI models to deliver contextually rich and accurate responses.
16+
This tutorial provides a comprehensive guide to building a retrieval-augmented generation (RAG) solution using Azure AI Content Understanding. It explains the essential components required to design and implement a robust RAG system, highlights best practices for optimizing relevance and accuracy, and outlines the integration points with other Azure services. By the end of this tutorial, you will have a clear understanding of how to leverage Content Understanding to process multimodal data, enhance retrieval precision, and enable generative AI models to deliver contextually rich and accurate responses.
1717

1818
## Exercises Covered in This Tutorial
1919

articles/ai-services/document-intelligence/concept/analyze-document-response.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -167,7 +167,7 @@ When *output=figures* is specified during the initial `Analyze` operation, the s
167167

168168
#### Sections
169169

170-
Hierarchical document structure analysis is pivotal in organizing, comprehending, and processing extensive documents. This approach is vital for semantically segmenting long documents to boost comprehension, facilitate navigation, and improve information retrieval. The advent of [Retrieval Augmented Generation (RAG)](../concept/retrieval-augmented-generation.md) in document generative AI underscores the significance of hierarchical document structure analysis. The Layout model supports sections and subsections in the output, which identifies the relationship of sections and object within each section. The hierarchical structure is maintained in `elements` of each section.
170+
Hierarchical document structure analysis is pivotal in organizing, comprehending, and processing extensive documents. This approach is vital for semantically segmenting long documents to boost comprehension, facilitate navigation, and improve information retrieval. The advent of [retrieval-augmented generation (RAG)](../concept/retrieval-augmented-generation.md) in document generative AI underscores the significance of hierarchical document structure analysis. The Layout model supports sections and subsections in the output, which identifies the relationship of sections and object within each section. The hierarchical structure is maintained in `elements` of each section.
171171

172172
```json
173173
{

articles/ai-services/document-intelligence/faq.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,7 @@ sections:
5656
5757
- With Azure AI Document Intelligence and Azure OpenAI combined, you can build an enterprise application to seamlessly interact with your documents using natural language. You can easily find answers, gain valuable insights, and generate new and engaging content from existing documents.
5858
59-
- You can find more details on the [retrieval augmented generation pattern here](concept/retrieval-augmented-generation.md).
59+
- You can find more details on the [retrieval-augmented generation pattern here](concept/retrieval-augmented-generation.md).
6060
6161
- question: |
6262
Can Document Intelligence help with semantic chunking within documents for retrieval-augmented generation?

articles/ai-services/document-intelligence/prebuilt/layout.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -526,7 +526,7 @@ if result.figures:
526526

527527
### Sections
528528

529-
Hierarchical document structure analysis is pivotal in organizing, comprehending, and processing extensive documents. This approach is vital for semantically segmenting long documents to boost comprehension, facilitate navigation, and improve information retrieval. The advent of [Retrieval Augmented Generation (RAG)](../concept/retrieval-augmented-generation.md) in document generative AI underscores the significance of hierarchical document structure analysis. The Layout model supports sections and subsections in the output, which identifies the relationship of sections and object within each section. The hierarchical structure is maintained in `elements` of each section. You can use [output response to markdown format](#output-response-to-markdown-format) to easily get the sections and subsections in markdown.
529+
Hierarchical document structure analysis is pivotal in organizing, comprehending, and processing extensive documents. This approach is vital for semantically segmenting long documents to boost comprehension, facilitate navigation, and improve information retrieval. The advent of [retrieval-augmented generation (RAG)](../concept/retrieval-augmented-generation.md) in document generative AI underscores the significance of hierarchical document structure analysis. The Layout model supports sections and subsections in the output, which identifies the relationship of sections and object within each section. The hierarchical structure is maintained in `elements` of each section. You can use [output response to markdown format](#output-response-to-markdown-format) to easily get the sections and subsections in markdown.
530530

531531
#### [Sample code](#tab/sample-code)
532532

0 commit comments

Comments
 (0)