Skip to content

Commit e99f6e2

Browse files
committed
Initial integrated vectorization improvements
1 parent 0136294 commit e99f6e2

File tree

3 files changed

+22
-6
lines changed

3 files changed

+22
-6
lines changed

app/backend/requirements.in

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ tiktoken
77
tenacity
88
azure-ai-documentintelligence
99
azure-cognitiveservices-speech
10-
azure-search-documents==11.6.0b1
10+
azure-search-documents==11.6.0b5
1111
azure-storage-blob
1212
azure-storage-file-datalake
1313
uvicorn

docs/data_ingestion.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -65,11 +65,13 @@ You can also remove individual documents by using the `--remove` flag. Open eith
6565

6666
## Overview of Integrated Vectorization
6767

68-
Azure AI search recently introduced an [integrated vectorization feature in preview mode](https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/announcing-the-public-preview-of-integrated-vectorization-in/ba-p/3960809#:~:text=Integrated%20vectorization%20is%20a%20new%20feature%20of%20Azure,pull-indexers%2C%20and%20vectorization%20of%20text%20queries%20through%20vectorizers). This feature is a cloud-based approach to data ingestion, which takes care of document format cracking, data extraction, chunking, vectorization, and indexing, all with Azure technologies.
68+
Azure AI Search includes an [integrated vectorization feature](https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/announcing-the-public-preview-of-integrated-vectorization-in/ba-p/3960809#:~:text=Integrated%20vectorization%20is%20a%20new%20feature%20of%20Azure,pull-indexers%2C%20and%20vectorization%20of%20text%20queries%20through%20vectorizers), a cloud-based approach to data ingestion. Integrated vectorization takes care of document format cracking, data extraction, chunking, vectorization, and indexing, all with Azure technologies.
6969

7070
See [this notebook](https://github.com/Azure/azure-search-vector-samples/blob/main/demo-python/code/integrated-vectorization/azure-search-integrated-vectorization-sample.ipynb) to understand the process of setting up integrated vectorization.
7171
We have integrated that code into our `prepdocs` script, so you can use it without needing to understand the details.
7272

73+
You must first explicitly [enable integrated vectorization](./deploy_features.md#enabling-integrated-vectorization) in the `azd` environment to use this feature.
74+
7375
This feature cannot be used on existing index. You need to create a new index or drop and recreate an existing index.
7476
In the newly created index schema, a new field 'parent_id' is added. This is used internally by the indexer to manage life cycle of chunks.
7577

docs/deploy_features.md

Lines changed: 18 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -192,12 +192,26 @@ Azure AI search recently introduced an [integrated vectorization feature in prev
192192

193193
To enable integrated vectorization with this sample:
194194

195-
1. If you've previously deployed, delete the existing search index.
196-
2. Run `azd env set USE_FEATURE_INT_VECTORIZATION true`
197-
3. Run `azd up` to update system and user roles
195+
1. If you've previously deployed, delete the existing search index. 🗑️
196+
2. To enable the use of integrated vectorization, run:
197+
198+
```shell
199+
azd env set USE_FEATURE_INT_VECTORIZATION true
200+
```
201+
202+
3. If you've already deployed your app, then you can run just the `provision` step:
203+
204+
```shell
205+
azd provision
206+
```
207+
208+
That will set up necessary RBAC roles and configure the integrated vectorization feature on your search service.
209+
210+
If you haven't deployed your app yet, then you should run the full `azd up` after configuring all optional features.
211+
198212
4. You can view the resources such as the indexer and skillset in Azure Portal and monitor the status of the vectorization process.
199213
200-
This feature is not currently compatible with GPT4-vision or the newer text-embedding-3 models.
214+
⚠️ This feature is not currently compatible with the [GPT vision integration](./gpt4v.md).
201215
202216
## Enabling authentication
203217

0 commit comments

Comments
 (0)