Skip to content

Commit b376d16

Browse files
Merge pull request #281341 from HeidiSteen/heidist-uuf
refreshed vector quickstart
2 parents 50319fb + 941f05d commit b376d16

File tree

1 file changed

+27
-25
lines changed

1 file changed

+27
-25
lines changed

articles/search/search-get-started-portal-import-vectors.md

Lines changed: 27 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -22,8 +22,8 @@ This quickstart helps you get started with [integrated vectorization (preview)](
2222
Key points about the wizard:
2323

2424
+ Source data is either Azure Blob Storage or OneLake files and shortcuts.
25-
+ Document parsing mode is nonconfigurable. It's the default parsing mode (one search document per blob or file).
26-
+ Index schema is nonconfigurable. The schema provides vector and nonvector fields for chunked data.
25+
+ Document parsing mode is the default (one search document per blob or file).
26+
+ Index schema is nonconfigurable. It provides vector and nonvector fields for chunked data.
2727
+ Chunking is nonconfigurable. The effective settings are:
2828

2929
```json
@@ -32,53 +32,55 @@ Key points about the wizard:
3232
pageOverlapLength: 500
3333
```
3434

35-
For fewer limitations or more data source options, try a code-based approach. For more information, see the [integrated vectorization sample](https://github.com/Azure/azure-search-vector-samples/blob/main/demo-python/code/integrated-vectorization/azure-search-integrated-vectorization-sample.ipynb).
36-
3735
## Prerequisites
3836

3937
+ An Azure subscription. [Create one for free](https://azure.microsoft.com/free/).
4038

41-
+ For data, either [Azure Blob Storage](/azure/storage/common/storage-account-overview) or a [OneLake lakehouse](search-how-to-index-onelake-files.md).
42-
43-
Azure Storage must be a standard performance (general-purpose v2) account. Access tiers can be hot, cool, and cold. Don't use Azure Data Lake Storage Gen2 (a storage account with a hierarchical namespace). This version of the wizard doesn't support Data Lake Storage Gen2.
39+
+ [Azure AI Search service](search-create-service-portal.md) in the same region as Azure AI. We recommend the Basic tier or higher.
4440

45-
+ For vectorization, have an [Azure AI services multiservice account](/azure/ai-services/multi-service-resource) or [Azure OpenAI Service](https://aka.ms/oai/access) endpoint with deployments.
41+
+ [Azure Blob Storage](/azure/storage/common/storage-account-overview) or a [OneLake lakehouse](search-how-to-index-onelake-files.md).
4642

47-
For [multimodal with Azure AI Vision](/azure/ai-services/computer-vision/how-to/image-retrieval), create an Azure AI service in SwedenCentral, EastUS, NorthEurope, WestEurope, WestUS, SoutheastAsia, KoreaCentral, FranceCentral, AustraliaEast, WestUS2, SwitzerlandNorth, or JapanEast. [Check the documentation](/azure/ai-services/computer-vision/how-to/image-retrieval?tabs=csharp) for an updated list.
43+
Azure Storage must be a standard performance (general-purpose v2) account. Access tiers can be hot, cool, and cold. Don't use Azure Data Lake Storage Gen2 (a storage account with a hierarchical namespace). This version of the wizard doesn't support Data Lake Storage Gen2.
4844

49-
You can also use an [Azure AI Studio model catalog](/azure/ai-studio/what-is-ai-studio) (and hub and project) with model deployments.
45+
+ An embedding model on a supported platform. [Deployment instructions](#set-up-embedding-models) are provided in this article.
5046

51-
+ For indexing and queries, have an [Azure AI Search service](search-create-service-portal.md). It must be in the same region as your Azure AI service. We recommend the Basic tier or higher.
47+
| Provider | Supported models |
48+
|---|---|
49+
| [Azure OpenAI Service](https://aka.ms/oai/access) | text-embedding-ada-002, text-embedding-3-large, or text-embedding-3-small. |
50+
| [Azure AI Studio model catalog](/azure/ai-studio/what-is-ai-studio) | Azure, Cohere, and Facebook embedding models. |
51+
| [Azure AI services multiservice account](/azure/ai-services/multi-service-resource) | [Azure AI Vision multimodal](/azure/ai-services/computer-vision/how-to/image-retrieval) for image and text vectorization. Azure AI Vision multimodal is available in selected regions: East US, West US, West US2, North Europe, West Europe, France Central, Sweden Central, Switzerland North, Southeast Asia, Korea Central, Australia East, or Japan East. [Check the documentation](/azure/ai-services/computer-vision/how-to/image-retrieval?tabs=csharp) for an updated list. |
5252

53-
+ Role assignments or API keys for connections to embedding models and data sources. This article provides instructions for role-based access control (RBAC).
53+
### Public endpoint requirements
5454

5555
All of the preceding resources must have public access enabled so that the portal nodes can access them. Otherwise, the wizard fails. After the wizard runs, you can enable firewalls and private endpoints on the integration components for security. For more information, see [Secure connections in the import wizards](search-import-data-portal.md#secure-connections).
5656

5757
If private endpoints are already present and you can't disable them, the alternative option is to run the respective end-to-end flow from a script or program on a virtual machine. The virtual machine must be on the same virtual network as the private endpoint. [Here's a Python code sample](https://github.com/Azure/azure-search-vector-samples/tree/main/demo-python/code/integrated-vectorization) for integrated vectorization. The same [GitHub repo](https://github.com/Azure/azure-search-vector-samples/tree/main) has samples in other programming languages.
5858

59-
A free search service supports RBAC on connections to Azure AI Search, but it doesn't support managed identities on outbound connections to Azure Storage or Azure AI Vision. This level of support means you must use key-based authentication on connections between a free search service and other Azure services. For connections that are more secure:
59+
### Role-based access control requirements
6060

61-
+ Use the Basic tier or higher.
62-
+ [Configure a managed identity](search-howto-managed-identities-data-sources.md) and role assignments to admit requests from Azure AI Search on other Azure services.
61+
We recommend role assignments for search service connections to other resources.
6362

64-
> [!NOTE]
65-
> If you can't progress through the wizard because options aren't available (for example, you can't select a data source or an embedding model), revisit the role assignments. Error messages indicate that models or deployments don't exist, when in fact the real problem is that the search service doesn't have permission to access them.
63+
1. On Azure AI Search, [enable roles](search-security-enable-roles.md).
6664

67-
## Check for space
65+
1. Configure your search service to [use a managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
6866

69-
If you're starting with the free service, you're limited to three indexes, three data sources, three skillsets, and three indexers. Make sure you have room for extra items before you begin. This quickstart creates one of each object.
67+
1. On your data source platform and embedding model provider, create role assignments that allow search service to access data and models. [Prepare sample data](#prepare-sample-data) provides instructions for setting up roles.
7068

71-
## Check for service identity
69+
A free search service supports RBAC on connections to Azure AI Search, but it doesn't support managed identities on outbound connections to Azure Storage or Azure AI Vision. This level of support means you must use key-based authentication on connections between a free search service and other Azure services.
7270

73-
We recommend role assignments for search service connections to other resources.
71+
For more secure connections:
7472

75-
1. On Azure AI Search, [enable RBAC](search-security-enable-roles.md).
73+
+ Use the Basic tier or higher.
74+
+ [Configure a managed identity](search-howto-managed-identities-data-sources.md) and use roles for authorized access.
75+
76+
> [!NOTE]
77+
> If you can't progress through the wizard because options aren't available (for example, you can't select a data source or an embedding model), revisit the role assignments. Error messages indicate that models or deployments don't exist, when in fact the real cause is that the search service doesn't have permission to access them.
7678
77-
1. Configure your search service to [use a system-assigned or user-assigned managed identity](search-howto-managed-identities-data-sources.md#create-a-system-managed-identity).
79+
### Check for space
7880

79-
In the following sections, you can assign the search service's managed identity to roles in other services. The sections provide steps for role assignments where applicable.
81+
If you're starting with the free service, you're limited to 3 indexes, data sources, skillsets, and indexers. Basic limits you to 15. Make sure you have room for extra items before you begin. This quickstart creates one of each object.
8082

81-
## Check for semantic ranking
83+
### Check for semantic ranking
8284

8385
The wizard supports semantic ranking, but only on the Basic tier and higher, and only if semantic ranking is already [enabled on your search service](semantic-how-to-enable-disable.md). If you're using a billable tier, check whether semantic ranking is enabled.
8486

0 commit comments

Comments
 (0)