You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-get-started-portal-image-search.md
+55-13Lines changed: 55 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,14 +6,14 @@ author: haileytap
6
6
ms.author: haileytapia
7
7
ms.service: azure-ai-search
8
8
ms.topic: quickstart
9
-
ms.date: 05/21/2025
9
+
ms.date: 05/22/2025
10
10
ms.custom:
11
11
- references_regions
12
12
---
13
13
14
14
# Quickstart: Search for multimodal content in the Azure portal
15
15
16
-
In this quickstart, you use the **Import and vectorize data** wizard in the Azure portal to get started with [multimodal search](multimodal-search-overview.md). The wizard simplifies the process of extracting page text and inline images from documents, describing images in natural language, vectorizing both text and image descriptions, and storing images for later retrieval.
16
+
In this quickstart, you use the **Import and vectorize data** wizard in the Azure portal to get started with [multimodal search](multimodal-search-overview.md). The wizard simplifies the process of extracting page text and inline images from documents, describing images in natural language, vectorizing image descriptions and text, and storing images for later retrieval.
17
17
18
18
The sample data consists of a multimodal PDF in the [azure-search-sample-data](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/sustainable-ai-pdf) repo, but you can use different files and still follow this quickstart.
19
19
@@ -23,7 +23,7 @@ The sample data consists of a multimodal PDF in the [azure-search-sample-data](h
23
23
24
24
+ An [Azure Storage account](/azure/storage/common/storage-account-create). Use Azure Blob Storage or Azure Data Lake Storage Gen2 (storage account with a hierarchical namespace) on a standard performance (general-purpose v2) account. Access tiers can be hot, cool, or cold.
25
25
26
-
+ An [Azure AI services multi-service account](/azure/ai-services/multi-service-resource#azure-ai-services-resource-for-azure-ai-search-skills) in East US, West Europe, or North Central US.
26
+
+ An [Azure AI services multi-service account](/azure/ai-services/multi-service-resource#azure-ai-multi-services-resource-for-azure-ai-search-skills) in East US, West Europe, or North Central US.
27
27
28
28
+ An [Azure AI Search service](search-create-service-portal.md) in the same region as your Azure AI multi-service account.
29
29
@@ -37,17 +37,59 @@ All of the preceding resources must have public access enabled so that the Azure
37
37
38
38
If private endpoints are already present and you can't disable them, the alternative is to run the respective end-to-end flow from a script or program on a virtual machine. The virtual machine must be on the same virtual network as the private endpoint. [Here's a Python code sample](https://github.com/Azure/azure-search-vector-samples/tree/main/demo-python/code/integrated-vectorization) for integrated vectorization. The same [GitHub repo](https://github.com/Azure/azure-search-vector-samples/tree/main) has samples in other programming languages.
39
39
40
-
### Role-based access
40
+
### Check for space
41
41
42
-
A free search service supports role-based access control on connections to Azure AI Search, but it doesn't support managed identities on outbound connections to Azure Storage or Azure AI Vision. This level of support means you must use key-based authentication on connections between a free search service and other Azure services. For more secure connections:
42
+
If you're starting with the free service, you're limited to three indexes, three data sources, three skillsets, and three indexers. Make sure you have room for extra items before you begin. This quickstart creates one of each object.
43
43
44
-
+ Use the Basic tier or higher.
44
+
## Configure access
45
45
46
-
+[Configure a system-assigned managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity)and role assignments to admit requests from Azure AI Search on other Azure services.
46
+
Before you begin, make sure you have permissions to access content and operations. We recommend Microsoft Entra ID authentication and role-based access for authorization. You must be an **Owner** or **User Access Administrator**to assign roles. If roles aren't feasible, you can use [key-based authentication](search-security-api-keys.md) instead.
47
47
48
-
### Check for space
48
+
Configure access to each resource identified in this section.
49
49
50
-
If you're starting with the free service, you're limited to three indexes, three data sources, three skillsets, and three indexers. Make sure you have room for extra items before you begin. This quickstart creates one of each object.
50
+
### [**Azure AI Search**](#tab/search-perms)
51
+
52
+
Azure AI Search provides the multimodal pipeline. Configure access for yourself and your search service to read data, run the pipeline, and interact with other Azure resources.
1.[Configure a system-assigned managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
59
+
60
+
1.[Assign the following roles](search-security-rbac.md) to yourself.
61
+
62
+
+**Search Service Contributor**
63
+
64
+
+**Search Index Data Contributor**
65
+
66
+
+**Search Index Data Reader**
67
+
68
+
### [**Azure Storage**](#tab/storage-perms)
69
+
70
+
Azure Storage is both the data source for your documents and the destination for extracted images. Your search service requires access to these storage containers, which you create in the next section of this quickstart.
71
+
72
+
On your Azure Storage account:
73
+
74
+
+ Assign **Storage Blob Data Contributor** to your [search service identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
75
+
76
+
### [**Azure AI services**](#tab/ai-services-perms)
77
+
78
+
An Azure AI multi-service account provides multiple Azure AI services, including [Azure AI Document Intelligence](/azure/ai-services/document-intelligence/overview) for content extraction and semantic chunking. Your search service requires access to call the [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md).
79
+
80
+
On your Azure AI multi-service account:
81
+
82
+
+ Assign **Cognitive Services User** to your [search service identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
83
+
84
+
### [**Azure OpenAI**](#tab/openai-perms)
85
+
86
+
Azure OpenAI provides large language models (LLMs) for image verbalization and embedding models for text and image vectorization. Your search service requires access to call the [GenAI Prompt skill](cognitive-search-skill-genai-prompt.md) and [Azure OpenAI Embedding skill](cognitive-search-skill-azure-openai-embedding.md).
87
+
88
+
On your Azure OpenAI resource:
89
+
90
+
+ Assign **Cognitive Services OpenAI User** to your [search service identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
91
+
92
+
---
51
93
52
94
## Prepare sample data
53
95
@@ -65,7 +107,7 @@ To prepare the sample data for this quickstart:
65
107
66
108
## Deploy models
67
109
68
-
The wizard requires a large language model (LLM) to verbalize images and an embedding model to generate vector representations of text and verbalized text content. Both models are available through Azure OpenAI.
110
+
The wizard requires an LLM to verbalize images and an embedding model to generate vector representations of text and verbalized text content. Both models are available through Azure OpenAI.
69
111
70
112
To deploy the models for this quickstart:
71
113
@@ -125,7 +167,7 @@ To connect to your data:
125
167
126
168
The next step is to select a method for document cracking and chunking.
127
169
128
-
Your Azure AI multi-service account provides access to the [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md), which extracts page numbers, bounding polygons, and other location metadata from both text and images. The Document Layout skill also breaks large documents into smaller, more manageable chunks.
170
+
Your Azure AI multi-service account provides access to the [Document Layout skill](cognitive-search-skill-document-intelligence-layout.md), which extracts page numbers, bounding polygons, and other location metadata from both text and images. The Document Layout skill also breaks documents into smaller, more manageable chunks.
129
171
130
172
To use the Document Layout skill:
131
173
@@ -143,7 +185,7 @@ To use the Document Layout skill:
143
185
144
186
## Embed your content
145
187
146
-
During this step, the wizard calls two skills to generate both descriptive text for images (image verbalization) and vector embeddings for text and images.
188
+
During this step, the wizard calls two skills to generate descriptive text for images (image verbalization) and vector embeddings for text and images.
147
189
148
190
For image verbalization, the [GenAI Prompt skill](cognitive-search-skill-genai-prompt.md) uses the LLM you deployed to analyze each extracted image and produce a natural-language description.
149
191
@@ -183,7 +225,7 @@ To use the GenAI Prompt skill and Azure OpenAI Embedding skill:
183
225
184
226
## Store the extracted images
185
227
186
-
The next step is to save any images extracted from your documents in Azure Storage. In Azure AI Search, this is known as a knowledge store.
228
+
The next step is to send images extracted from your documents to Azure Storage. In Azure AI Search, this secondary storage is known as a [knowledge store](knowledge-store-concept-intro.md).
Copy file name to clipboardExpand all lines: articles/search/search-get-started-portal-import-vectors.md
+14-6Lines changed: 14 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ ms.custom:
9
9
- build-2024
10
10
- ignite-2024
11
11
ms.topic: quickstart
12
-
ms.date: 05/21/2025
12
+
ms.date: 05/22/2025
13
13
---
14
14
15
15
# Quickstart: Vectorize text in the Azure portal
@@ -47,7 +47,7 @@ For integrated vectorization, you must use one of the following embedding models
47
47
| Provider | Supported models |
48
48
|--|--|
49
49
|[Azure OpenAI in Azure AI Foundry Models](/azure/ai-services/openai/how-to/create-resource) <sup>1, 2</sup> | text-embedding-ada-002<br>text-embedding-3-small<br>text-embedding-3-large |
50
-
|[Azure AI services multi-service resource](/azure/ai-services/multi-service-resource#azure-ai-services-resource-for-azure-ai-search-skills) <sup>3</sup> | For text and images: [Azure AI Vision multimodal](/azure/ai-services/computer-vision/how-to/image-retrieval) <sup>4</sup></li> |
50
+
|[Azure AI services multi-service resource](/azure/ai-services/multi-service-resource#azure-ai-multi-services-resource-for-azure-ai-search-skills) <sup>3</sup> | For text and images: [Azure AI Vision multimodal](/azure/ai-services/computer-vision/how-to/image-retrieval) <sup>4</sup></li> |
51
51
|[Azure AI Foundry model catalog](/azure/ai-foundry/what-is-azure-ai-foundry)| For text:<br>Cohere-embed-v3-english<br>Cohere-embed-v3-multilingual<br><br>For images:<br>Facebook-DinoV2-Image-Embeddings-ViT-Base<br>Facebook-DinoV2-Image-Embeddings-ViT-Giant |
52
52
53
53
<sup>1</sup> The endpoint of your Azure OpenAI resource must have a [custom subdomain](/azure/ai-services/cognitive-services-custom-subdomains), such as `https://my-unique-name.openai.azure.com`. If you created your resource in the [Azure portal](https://portal.azure.com/), this subdomain was automatically generated during resource setup.
@@ -74,6 +74,14 @@ To configure the recommended role-based access:
74
74
75
75
1. On your search service, [enable roles](search-security-enable-roles.md) and [configure a system-assigned managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
76
76
77
+
1.[Assign the following roles](search-security-rbac.md) to yourself.
78
+
79
+
+**Search Service Contributor**
80
+
81
+
+**Search Index Data Contributor**
82
+
83
+
+**Search Index Data Reader**
84
+
77
85
1. On your data source platform and embedding model provider, create role assignments that allow your search service to access data and models. See [Prepare sample data](#prepare-sample-data) and [Prepare embedding models](#prepare-embedding-model).
78
86
79
87
> [!NOTE]
@@ -101,7 +109,7 @@ This section points you to the content that works for this quickstart. Before yo
101
109
102
110
1. Select **Add** > **Add role assignment**.
103
111
104
-
1. Under **Job function roles**, select **[Storage Blob Data Reader](search-howto-managed-identities-data-sources.md#assign-a-role)**, and then select **Next**.
112
+
1. Under **Job function roles**, select **Storage Blob Data Reader**, and then select **Next**.
105
113
106
114
1. Under **Members**, select **Managed identity**, and then select **Select members**.
107
115
@@ -127,7 +135,7 @@ This section points you to the content that works for this quickstart. Before yo
127
135
128
136
1. Select **Add** > **Add role assignment**.
129
137
130
-
1. Under **Job function roles**, select **[Storage Blob Data Reader](search-howto-managed-identities-data-sources.md#assign-a-role)**, and then select **Next**.
138
+
1. Under **Job function roles**, select **Storage Blob Data Reader**, and then select **Next**.
131
139
132
140
1. Under **Members**, select **Managed identity**, and then select **Select members**.
133
141
@@ -188,7 +196,7 @@ The wizard supports text-embedding-ada-002, text-embedding-3-large, and text-emb
188
196
189
197
1. Select **Add** > **Add role assignment**.
190
198
191
-
1. Under **Job function roles**, select **[Cognitive Services OpenAI User](/azure/ai-services/openai/how-to/role-based-access-control#azure-openai-roles)**, and then select **Next**.
199
+
1. Under **Job function roles**, select **Cognitive Services OpenAI User**, and then select **Next**.
192
200
193
201
1. Under **Members**, select **Managed identity**, and then select **Select members**.
194
202
@@ -371,7 +379,7 @@ In this step, you specify an embedding model to vectorize chunked data. Chunking
371
379
372
380
+ Azure AI Foundry model catalog
373
381
374
-
+ An Azure AI Vision multimodal resource in the same region as Azure AI Search. If there's no [Azure AI services multi-service account](/azure/ai-services/multi-service-resource#azure-ai-services-resource-for-azure-ai-search-skills) in the same region, this option isn't available.
382
+
+ An Azure AI Vision multimodal resource in the same region as Azure AI Search. If there's no [Azure AI services multi-service account](/azure/ai-services/multi-service-resource#azure-ai-multi-services-resource-for-azure-ai-search-skills) in the same region, this option isn't available.
0 commit comments