Merge pull request #4738 from HeidiSteen/heidist-rb-rag

prmerger-automator[bot] · web-flow · commit 63459d551f6c · 2025-05-11T13:52:50.000Z
consistency and differentiation among the four multimodal tutorials
diff --git a/articles/search/search-indexer-access-control-lists-and-role-based-access.md b/articles/search/search-indexer-access-control-lists-and-role-based-access.md
@@ -51,7 +51,7 @@ This article supplements [**Index data from ADLS  Gen2**](search-howto-index-azu
   + [Knowledge store](knowledge-store-concept-intro.md)
   + [Indexer enrichment cache](search-howto-incremental-index.md)
   + [Debug sessions](cognitive-search-debug-session.md)
-  + One-to-many [parsing modes](/rest/api/searchservice/indexers/create?view=rest-searchservice-2025-05-01-preview&tabs=HTTP#blobindexerparsingmode), such as: `delimitedText`, `jsonArray`, `jaonLines`, and `markdown` with sub-mode `oneToMany`
+  + One-to-many [parsing modes](/rest/api/searchservice/indexers/create?view=rest-searchservice-2025-05-01-preview&preserve-view=true#blobindexerparsingmode), such as: `delimitedText`, `jsonArray`, `jaonLines`, and `markdown` with sub-mode `oneToMany`
 
 ## About ACL hierarchical permissions
 
diff --git a/articles/search/tutorial-multimodal-index-embeddings-skill.md b/articles/search/tutorial-multimodal-index-embeddings-skill.md
@@ -1,7 +1,7 @@
 ---
 title: 'Tutorial: Index multimodal content using multimodal embedding and document layout skill'
 titleSuffix: Azure AI Search
-description: Learn how to extract, index, and search both text and images from Azure Blob Storage for multimodal scenarios using the Azure AI Search REST APIs.
+description: Learn how to extract, index, and search multimodal content using the Document Layout skill for chunking and Azure AI Vision for embeddings.
 
 manager: arjagann
 author: rawan    
@@ -13,26 +13,24 @@ ms.date: 05/05/2025
 
 ---
 
-# Tutorial: Index multimodal content using multimodal embedding and document layout skill
+# Tutorial: Index mixed content using multimodal embeddings and the Document Layout skill
 
-Multimodal plays an essential role in generative AI apps and the user experience as it enables the extraction of information not only from text but also from complex images embedded within documents. In this Azure AI Search tutorial, learn how to build a multimodal retrieval pipeline that chunks data based on document structure, and uses a multimodal embedding model to vectorize text and images in a searchable index.
+<!-- Multimodal plays an essential role in generative AI apps and the user experience as it enables the extraction of information not only from text but also from complex images embedded within documents.  -->
+In this Azure AI Search tutorial, learn how to build a multimodal indexing pipeline that chunks data based on document structure, and uses a multimodal embedding model to vectorize text and images in a searchable index.
 
-You’ll work with a 36-page PDF document that combines rich visual content—such as charts, infographics, and scanned pages—with traditional text. Using the [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md)(currently in public preview), you’ll extract both text and normalized images with its locationMetadata. Each modality is then embedded using the same [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates dense vector representations suitable for semantic and hybrid search scenarios.
+In this tutorial, you use:
 
-You'll use:
++ A 36-page PDF document that combines rich visual content—such as charts, infographics, and scanned pages—with traditional text.
 
-+ The [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) for extracting text and normalized images.
++ The [Document Layout skill (preview)](cognitive-search-skill-document-intelligence-layout.md) for extracting text and normalized images with its locationMetadata from various documents, such as page numbers or bounding regions.
 
-+ Vectorization using the [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates embeddings from both text and images. The same skill is used for both modalities, with text inputs processed into embeddings for semantic search, and images processed into vector representations using Azure AI Vision models.
+  The [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) has limited region availability and is bound to Azure AI services and requires [a billable resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day. For a lower-cost solution that indexing multimodal content, see [Index multimodal content using image verbalization and document extraction skill](https://aka.ms/azs-multimodal).
 
-+ A search index configured to store text and image embeddings and support vector-based similarity search.
++ Vectorization using the [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates embeddings for both text and images.
 
-This tutorial demonstrates a solution for indexing multi-modal content using Document Layout skill. Document Layout skill
-enables extraction both text and image with its locational metadata from various documents, such as page numbers or bounding regions. However, [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) has limited region availability and is bound to Azure AI services and requires [a billable resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day
++ A search index configured to store text and image embeddings and support for vector-based similarity search.
 
-For a lower-cost solution that indexing multi-modal content, see [Index multi-modal content using embedding and document extraction skill](https://aka.ms/azs-multimodal).
-
-This tutorial shows you how to  index such data, using a REST client and the [Search REST APIs](/rest/api/searchservice/) to:
+Using a REST client and the [Search REST APIs](/rest/api/searchservice/), you will:
 
 > [!div class="checklist"]
 > + Set up sample data and configure an `azureblob` data source
@@ -614,4 +612,4 @@ Now that you're familiar with a sample implementation of a multimodal indexing s
 + [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md)
 + [Vectors in Azure AI Search](vector-search-overview.md)
 + [Semantic ranking in Azure AI Search](semantic-search-overview.md)
-+ [Index multi-modal content using embedding and document extraction skill](https://aka.ms/azs-multimodal)
++ [Index multimodal content using embedding and document extraction skill](https://aka.ms/azs-multimodal)
diff --git a/articles/search/tutorial-multimodal-index-image-verbalization-skill.md b/articles/search/tutorial-multimodal-index-image-verbalization-skill.md
@@ -1,7 +1,7 @@
 ---
 title: 'Tutorial: Index multimodal content using image verbalization and document layout skill'
 titleSuffix: Azure AI Search
-description: Learn how to extract, describe, and index text and images from Azure Blob Storage using GenAI Prompt skill and Azure AI Search REST APIs to support multimodal scenarios.
+description: Learn how to extract, index, and search multimodal content using the Document Layout skill for chunking and GenAI Prompt skill for image verbalizations.
 
 manager: arjagann
 author: rawan    
@@ -13,28 +13,30 @@ ms.date: 05/05/2025
 
 ---
 
-# Tutorial: Index multimodal content using image verbalization and document layout skill
+# Tutorial: Index mixed content using image verbalizations and the Document Layout skill
 
-Multi-modality plays an essential role in generative AI apps and the user experience as it enables the extraction of information not only from text but also from complex images embedded within documents. "In this Azure AI Search tutorial, learn how to build a multimodal retrieval pipeline that that chunks data based on document structure, and =uses image verbalization to describe images. Cropped images are stored in a knowledge store, and visual content is described in natural language and ingested alongside text in a searchable index.
+In this Azure AI Search tutorial, learn how to build a multimodal indexing pipeline that that chunks data based on document structure, and uses image verbalization to describe images. Cropped images are stored in a knowledge store, and visual content is described in natural language and ingested alongside text in a searchable index.
 
-You’ll work with a 36-page PDF document that combines rich visual content—such as charts, infographics, and scanned pages—with traditional text. Using the [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md)(currently in public preview), you’ll extract both text and normalized images with its locationMetadata. Each image is passed to the [GenAI Prompt skill](cognitive-search-skill-genai-prompt.md) (currently in public preview) to generate a concise textual description. These descriptions, along with the original document text, are then embedded into vector representations using Azure OpenAI’s text-embedding-3-large model. The result is a single index containing semantically searchable content from both modalities—text and verbalized images.
+From the source document, each image is passed to the [GenAI Prompt skill (preview)](cognitive-search-skill-genai-prompt.md) to generate a concise textual description. These descriptions, along with the original document text, are then embedded into vector representations using Azure OpenAI’s text-embedding-3-large model. The result is a single index containing semantically searchable content from both modalities—text and verbalized images.
 
-You'll use:
+In this tutorial, you use:
 
-+ The [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) for extracting text and normalized images.
-+ The [GenAI Prompt skill](cognitive-search-skill-genai-prompt.md) to generate image captions — text-based descriptions of visual content — for search and grounding.
-+ Vectorization using the [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates embeddings from both text and images. The same skill is used for both modalities, with text inputs processed into embeddings for semantic search, and images processed into vector representations using Azure AI Vision models.
-+ A search index configured to store text and image embeddings and support vector-based similarity search.
++ A 36-page PDF document that combines rich visual content—such as charts, infographics, and scanned pages—with traditional text.
+
++ The [Document Layout skill (preview)](cognitive-search-skill-document-intelligence-layout.md) for extracting text and normalized images with its locationMetadata from various documents, such as page numbers or bounding regions.
+
+  The [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) has limited region availability and is bound to Azure AI services and requires [a billable resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day. For a lower-cost solution that indexing multimodal content, see [Index multimodal content using image verbalization and document extraction skill](https://aka.ms/azs-multimodal).
 
-This tutorial demonstrates a solution for indexing multi-modal content using Document Layout skill. Document Layout skill
-enables extraction both text and image with its locational metadata from various documents, such as page numbers or bounding regions. However, [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) has limited region availability and is bound to Azure AI services and requires [a billable resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day
++ The [GenAI Prompt skill (preview)](cognitive-search-skill-genai-prompt.md) to generate image captions — text-based descriptions of visual content — for search and grounding.
+
++ Vectorization using the [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates embeddings from both text and images. The same skill is used for both modalities, with text inputs processed into embeddings for semantic search, and images processed into vector representations using Azure AI Vision models.
 
-For a lower-cost solution that indexing multi-modal content, see [Index multimodal content using image verbalization and document extraction skill](https://aka.ms/azs-multimodal).
++ A search index configured to store text and image embeddings and support for vector-based similarity search.
 
 > [!NOTE]
-> Setting `imageAction` to `generateNormalizedImages` as is required for this tutorial will incur an additional charge for image extraction according to [Azure AI Search pricing](https://azure.microsoft.com/pricing/details/search/).
+> Setting `imageAction` to `generateNormalizedImages` is required for this tutorial and incurs an additional charge for image extraction according to [Azure AI Search pricing](https://azure.microsoft.com/pricing/details/search/).
 
-Using a REST client and the [Search REST APIs](/rest/api/searchservice/) you will:
+Using a REST client and the [Search REST APIs](/rest/api/searchservice/), you will:
 
 > [!div class="checklist"]
 > + Set up sample data and configure an `azureblob` data source
diff --git a/articles/search/tutorial-multimodal-indexing-with-embedding-and-doc-extraction.md b/articles/search/tutorial-multimodal-indexing-with-embedding-and-doc-extraction.md
@@ -1,7 +1,7 @@
 ---
 title: 'Tutorial: Index multimodal content using embedding and document extraction skill'
 titleSuffix: Azure AI Search
-description: Learn how to extract, index, and search both text and images from Azure Blob Storage for multimodal scenarios using the Azure AI Search REST APIs.
+description: Learn how to extract, index, and search multimodal content using the Document Extracction skill for chunking and Azure AI Vision for embeddings.
 
 manager: arjagann
 author: mdonovan
@@ -13,19 +13,19 @@ ms.date: 05/01/2025
 
 ---
 
-# Tutorial: Index multimodal content using embedding and document extraction skill
+# Tutorial: Index mixed content using multimodal embeddings and the Document Extraction skill
 
-Azure AI Search can extract and index both text and images from PDF documents stored in Azure Blob Storage. This tutorial shows how to build a multimodal retrieval pipeline by embedding both text and images into a unified semantic search index.
+Azure AI Search can extract and index both text and images from PDF documents stored in Azure Blob Storage. This tutorial shows you how to build a multimodal indexing pipeline by embedding both text and images into a unified semantic search index.
 
-You’ll work with a 36-page PDF document that combines rich visual content—such as charts, infographics, and scanned pages—with traditional text. Using the [Document Extraction skill](cognitive-search-skill-document-extraction.md), you’ll extract both text and normalized images. Each modality is then embedded using the same [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates dense vector representations suitable for semantic and hybrid search scenarios.
+In this tutorial, you use:
 
-You'll use:
++ A 36-page PDF document that combines rich visual content—such as charts, infographics, and scanned pages—with traditional text.
 
 + The [Document Extraction skill](cognitive-search-skill-document-extraction.md) for extracting text and normalized images.
 
-+ Vectorization using the [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates embeddings from both text and images. The same skill is used for both modalities, with text inputs processed into embeddings for semantic search, and images processed into vector representations using Azure AI Vision models.
++ Vectorization using the [Azure AI Vision multimodal embeddings skill](cognitive-search-skill-vision-vectorize.md), which generates embeddings for both text and images.
 
-+ A search index configured to store text and image embeddings and support vector-based similarity search.
++ A search index configured to store text and image embeddings and support for vector-based similarity search.
 
 This tutorial demonstrates a lower-cost approach for indexing multimodal content using Document Extraction skill and image captioning. It enables extraction and search over both text and images from documents in Azure Blob Storage. However, it does not include locational metadata for text, such as page numbers or bounding regions. 
 
@@ -34,7 +34,7 @@ For a more comprehensive solution that includes structured text layout and spati
 > [!NOTE] 
 > Setting `imageAction` to `generateNormalizedImages` as is required for this tutorial will incur an additional charge for image extraction according to [Azure AI Search pricing](https://azure.microsoft.com/pricing/details/search/).
 
-This tutorial shows you how to  index such data, using a REST client and the [Search REST APIs](/rest/api/searchservice/) to:
+Using a REST client and the [Search REST APIs](/rest/api/searchservice/) you will:
 
 > [!div class="checklist"]
 > + Set up sample data and configure an `azureblob` data source
diff --git a/articles/search/tutorial-multimodal-indexing-with-image-verbalization-and-doc-extraction.md b/articles/search/tutorial-multimodal-indexing-with-image-verbalization-and-doc-extraction.md
diff --git a/articles/search/vector-search-multi-vector-fields.md b/articles/search/vector-search-multi-vector-fields.md