Skip to content

Commit c3bfffe

Browse files
Merge pull request #284711 from HeidiSteen/heidist-august
[azure search] Robert's feedback on stored false effect on reindexing
2 parents 5ef39c2 + 54499a8 commit c3bfffe

File tree

2 files changed

+11
-5
lines changed

2 files changed

+11
-5
lines changed

articles/search/search-howto-reindex.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.author: heidist
99

1010
ms.service: cognitive-search
1111
ms.topic: how-to
12-
ms.date: 07/01/2024
12+
ms.date: 08/14/2024
1313
---
1414

1515
# Update or rebuild an index in Azure AI Search
@@ -43,10 +43,12 @@ Queries continue to run, but if you're updating or removing existing fields, you
4343

4444
+ [Indexers automate incremental indexing](search-indexer-overview.md). If you can use an indexer, and if the data source supports change tracking, you can run the indexer on a recurring schedule to add, update, or overwrite searchable content so that it's synchronized to your external data.
4545

46-
+ If you're making index calls directly, use `mergeOrUpload` as the search action.
46+
+ If you're making index calls directly through the [push API](search-what-is-data-import.md#pushing-data-to-an-index), use `mergeOrUpload` as the search action.
4747

4848
+ The payload must include the keys or identifiers of every document you want to add, update, or delete.
4949

50+
+ If your index includes vector fields and you set the [`stored` property to false](vector-search-how-to-configure-compression-storage.md#option-3-set-the-stored-property-to-remove-retrievable-storage), make sure you provide the vector in your partial document update, even if the value is unchanged. A side effect of setting `stored` to false is that vectors are dropped on a reindexing operation. Providing the vector in the documents payload prevents this from happening.
51+
5052
+ To update the contents of simple fields and subfields in complex types, list only the fields you want to change. For example, if you only need to update a description field, the payload should consist of the document key and the modified description. Omitting other fields retains their existing values.
5153

5254
+ To merge inline changes into string collection, provide the entire value. Recall the `tags` field example from the previous section. New values overwrite the old values for an entire field, and there's no merging within the content of a field.

articles/search/vector-search-how-to-configure-compression-storage.md

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ author: heidisteen
77
ms.author: heidist
88
ms.service: cognitive-search
99
ms.topic: how-to
10-
ms.date: 08/05/2024
10+
ms.date: 08/14/2024
1111
---
1212

1313
# Reduce vector size through quantization, narrow data types, and storage options
@@ -40,7 +40,7 @@ Quantization applies to vector fields receiving float-type vectors. In the examp
4040

4141
Two types of quantization are supported:
4242

43-
- Scalar quantization compresses floats into narrower data types. AI Search currently supports int8, which is 8 bits, reducing vector index size fourfold.
43+
- Scalar quantization compresses float values into narrower data types. AI Search currently supports int8, which is 8 bits, reducing vector index size fourfold.
4444

4545
- Binary quantization converts floats into binary bits, which takes up 1 bit. This results in up to 28 times reduced vector index size.
4646

@@ -228,7 +228,11 @@ An easy way to reduce vector size is to store embeddings in a smaller data forma
228228

229229
The `stored` property is a boolean on a vector field definition that determines whether storage is allocated for retrievable vector field content. The `stored` property is true by default. If you don't need vector content in a query response, you can save up to 50 percent storage per field by setting `stored` to false.
230230

231-
When evaluating whether to set this property, consider whether you need vectors in the response. Because vectors aren't human readable, they can be omitted in a query response that's rendered on a search page. Keep them, however, if you're using vectors in downstream process that consumes vector content.
231+
Considerations for setting `stored` to false:
232+
233+
- Because vectors aren't human readable, you can omit them from results sent to LLMs in RAG scenarios, and from results that are rendered on a search page. Keep them, however, if you're using vectors in a downstream process that consumes vector content.
234+
235+
- However, if your indexing strategy includes [partial document updates](search-howto-reindex.md#update-content), such as "merge" or "mergeOrUpload" on a document, be aware that setting `stored` to false will cause vectors in the non-stored field to be omitted during the merge. On each "merge" or "mergeOrUpload" operation, you must provide the vector fields in addition to other nonvector fields that you're updating, or the vector will be dropped.
232236

233237
Remember that the `stored` attribution is irreversible. It's set during index creation on vector fields when physical data structures are created. If you want retrievable vector content later, you must drop and rebuild the index, or create and load a new field that has the new attribution.
234238

0 commit comments

Comments
 (0)