You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-get-started-portal-import-vectors.md
+27-25Lines changed: 27 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,8 +22,8 @@ This quickstart helps you get started with [integrated vectorization (preview)](
22
22
Key points about the wizard:
23
23
24
24
+ Source data is either Azure Blob Storage or OneLake files and shortcuts.
25
-
+ Document parsing mode is nonconfigurable. It's the default parsing mode (one search document per blob or file).
26
-
+ Index schema is nonconfigurable. The schema provides vector and nonvector fields for chunked data.
25
+
+ Document parsing mode is the default (one search document per blob or file).
26
+
+ Index schema is nonconfigurable. It provides vector and nonvector fields for chunked data.
27
27
+ Chunking is nonconfigurable. The effective settings are:
28
28
29
29
```json
@@ -32,53 +32,55 @@ Key points about the wizard:
32
32
pageOverlapLength: 500
33
33
```
34
34
35
-
For fewer limitations or more data source options, try a code-based approach. For more information, see the [integrated vectorization sample](https://github.com/Azure/azure-search-vector-samples/blob/main/demo-python/code/integrated-vectorization/azure-search-integrated-vectorization-sample.ipynb).
36
-
37
35
## Prerequisites
38
36
39
37
+ An Azure subscription. [Create one for free](https://azure.microsoft.com/free/).
40
38
41
-
+ For data, either [Azure Blob Storage](/azure/storage/common/storage-account-overview) or a [OneLake lakehouse](search-how-to-index-onelake-files.md).
42
-
43
-
Azure Storage must be a standard performance (general-purpose v2) account. Access tiers can be hot, cool, and cold. Don't use Azure Data Lake Storage Gen2 (a storage account with a hierarchical namespace). This version of the wizard doesn't support Data Lake Storage Gen2.
39
+
+[Azure AI Search service](search-create-service-portal.md) in the same region as Azure AI. We recommend the Basic tier or higher.
44
40
45
-
+For vectorization, have an [Azure AI services multiservice account](/azure/ai-services/multi-service-resource) or [Azure OpenAI Service](https://aka.ms/oai/access) endpoint with deployments.
41
+
+[Azure Blob Storage](/azure/storage/common/storage-account-overview) or a [OneLake lakehouse](search-how-to-index-onelake-files.md).
46
42
47
-
For [multimodal with Azure AI Vision](/azure/ai-services/computer-vision/how-to/image-retrieval), create an Azure AI service in SwedenCentral, EastUS, NorthEurope, WestEurope, WestUS, SoutheastAsia, KoreaCentral, FranceCentral, AustraliaEast, WestUS2, SwitzerlandNorth, or JapanEast. [Check the documentation](/azure/ai-services/computer-vision/how-to/image-retrieval?tabs=csharp) for an updated list.
43
+
Azure Storage must be a standard performance (general-purpose v2) account. Access tiers can be hot, cool, and cold. Don't use Azure Data Lake Storage Gen2 (a storage account with a hierarchical namespace). This version of the wizard doesn't support Data Lake Storage Gen2.
48
44
49
-
You can also use an [Azure AI Studio model catalog](/azure/ai-studio/what-is-ai-studio) (and hub and project) with model deployments.
45
+
+ An embedding model on a supported platform. [Deployment instructions](#set-up-embedding-models) are provided in this article.
50
46
51
-
+ For indexing and queries, have an [Azure AI Search service](search-create-service-portal.md). It must be in the same region as your Azure AI service. We recommend the Basic tier or higher.
47
+
| Provider | Supported models |
48
+
|---|---|
49
+
|[Azure OpenAI Service](https://aka.ms/oai/access)| text-embedding-ada-002, text-embedding-3-large, or text-embedding-3-small. |
50
+
|[Azure AI Studio model catalog](/azure/ai-studio/what-is-ai-studio)| Azure, Cohere, and Facebook embedding models. |
51
+
|[Azure AI services multiservice account](/azure/ai-services/multi-service-resource)|[Azure AI Vision multimodal](/azure/ai-services/computer-vision/how-to/image-retrieval) for image and text vectorization. Azure AI Vision multimodal is available in selected regions: East US, West US, West US2, North Europe, West Europe, France Central, Sweden Central, Switzerland North, Southeast Asia, Korea Central, Australia East, or Japan East. [Check the documentation](/azure/ai-services/computer-vision/how-to/image-retrieval?tabs=csharp) for an updated list. |
52
52
53
-
+ Role assignments or API keys for connections to embedding models and data sources. This article provides instructions for role-based access control (RBAC).
53
+
### Public endpoint requirements
54
54
55
55
All of the preceding resources must have public access enabled so that the portal nodes can access them. Otherwise, the wizard fails. After the wizard runs, you can enable firewalls and private endpoints on the integration components for security. For more information, see [Secure connections in the import wizards](search-import-data-portal.md#secure-connections).
56
56
57
57
If private endpoints are already present and you can't disable them, the alternative option is to run the respective end-to-end flow from a script or program on a virtual machine. The virtual machine must be on the same virtual network as the private endpoint. [Here's a Python code sample](https://github.com/Azure/azure-search-vector-samples/tree/main/demo-python/code/integrated-vectorization) for integrated vectorization. The same [GitHub repo](https://github.com/Azure/azure-search-vector-samples/tree/main) has samples in other programming languages.
58
58
59
-
A free search service supports RBAC on connections to Azure AI Search, but it doesn't support managed identities on outbound connections to Azure Storage or Azure AI Vision. This level of support means you must use key-based authentication on connections between a free search service and other Azure services. For connections that are more secure:
59
+
### Role-based access control requirements
60
60
61
-
+ Use the Basic tier or higher.
62
-
+[Configure a managed identity](search-howto-managed-identities-data-sources.md) and role assignments to admit requests from Azure AI Search on other Azure services.
61
+
We recommend role assignments for search service connections to other resources.
63
62
64
-
> [!NOTE]
65
-
> If you can't progress through the wizard because options aren't available (for example, you can't select a data source or an embedding model), revisit the role assignments. Error messages indicate that models or deployments don't exist, when in fact the real problem is that the search service doesn't have permission to access them.
63
+
1. On Azure AI Search, [enable roles](search-security-enable-roles.md).
66
64
67
-
## Check for space
65
+
1. Configure your search service to [use a managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
68
66
69
-
If you're starting with the free service, you're limited to three indexes, three data sources, three skillsets, and three indexers. Make sure you have room for extra items before you begin. This quickstart creates one of each object.
67
+
1. On your data source platform and embedding model provider, create role assignments that allow search service to access data and models. [Prepare sample data](#prepare-sample-data) provides instructions for setting up roles.
70
68
71
-
## Check for service identity
69
+
A free search service supports RBAC on connections to Azure AI Search, but it doesn't support managed identities on outbound connections to Azure Storage or Azure AI Vision. This level of support means you must use key-based authentication on connections between a free search service and other Azure services.
72
70
73
-
We recommend role assignments for search service connections to other resources.
71
+
For more secure connections:
74
72
75
-
1. On Azure AI Search, [enable RBAC](search-security-enable-roles.md).
73
+
+ Use the Basic tier or higher.
74
+
+[Configure a managed identity](search-howto-managed-identities-data-sources.md) and use roles for authorized access.
75
+
76
+
> [!NOTE]
77
+
> If you can't progress through the wizard because options aren't available (for example, you can't select a data source or an embedding model), revisit the role assignments. Error messages indicate that models or deployments don't exist, when in fact the real cause is that the search service doesn't have permission to access them.
76
78
77
-
1. Configure your search service to [use a system-assigned or user-assigned managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
79
+
### Check for space
78
80
79
-
In the following sections, you can assign the search service's managed identity to roles in other services. The sections provide steps for role assignments where applicable.
81
+
If you're starting with the free service, you're limited to 3 indexes, data sources, skillsets, and indexers. Basic limits you to 15. Make sure you have room for extra items before you begin. This quickstart creates one of each object.
80
82
81
-
## Check for semantic ranking
83
+
###Check for semantic ranking
82
84
83
85
The wizard supports semantic ranking, but only on the Basic tier and higher, and only if semantic ranking is already [enabled on your search service](semantic-how-to-enable-disable.md). If you're using a billable tier, check whether semantic ranking is enabled.
0 commit comments