You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-howto-reindex.md
+4-2Lines changed: 4 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ ms.author: heidist
9
9
10
10
ms.service: cognitive-search
11
11
ms.topic: how-to
12
-
ms.date: 07/01/2024
12
+
ms.date: 08/14/2024
13
13
---
14
14
15
15
# Update or rebuild an index in Azure AI Search
@@ -43,10 +43,12 @@ Queries continue to run, but if you're updating or removing existing fields, you
43
43
44
44
+[Indexers automate incremental indexing](search-indexer-overview.md). If you can use an indexer, and if the data source supports change tracking, you can run the indexer on a recurring schedule to add, update, or overwrite searchable content so that it's synchronized to your external data.
45
45
46
-
+ If you're making index calls directly, use `mergeOrUpload` as the search action.
46
+
+ If you're making index calls directly through the [push API](search-what-is-data-import.md#pushing-data-to-an-index), use `mergeOrUpload` as the search action.
47
47
48
48
+ The payload must include the keys or identifiers of every document you want to add, update, or delete.
49
49
50
+
+ If your index includes vector fields and you set the [`stored` property to false](vector-search-how-to-configure-compression-storage.md#option-3-set-the-stored-property-to-remove-retrievable-storage), make sure you provide the vector in your partial document update, even if the value is unchanged. A side effect of setting `stored` to false is that vectors are dropped on a reindexing operation. Providing the vector in the documents payload prevents this from happening.
51
+
50
52
+ To update the contents of simple fields and subfields in complex types, list only the fields you want to change. For example, if you only need to update a description field, the payload should consist of the document key and the modified description. Omitting other fields retains their existing values.
51
53
52
54
+ To merge inline changes into string collection, provide the entire value. Recall the `tags` field example from the previous section. New values overwrite the old values for an entire field, and there's no merging within the content of a field.
Copy file name to clipboardExpand all lines: articles/search/vector-search-how-to-configure-compression-storage.md
+7-3Lines changed: 7 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ author: heidisteen
7
7
ms.author: heidist
8
8
ms.service: cognitive-search
9
9
ms.topic: how-to
10
-
ms.date: 08/05/2024
10
+
ms.date: 08/14/2024
11
11
---
12
12
13
13
# Reduce vector size through quantization, narrow data types, and storage options
@@ -40,7 +40,7 @@ Quantization applies to vector fields receiving float-type vectors. In the examp
40
40
41
41
Two types of quantization are supported:
42
42
43
-
- Scalar quantization compresses floats into narrower data types. AI Search currently supports int8, which is 8 bits, reducing vector index size fourfold.
43
+
- Scalar quantization compresses float values into narrower data types. AI Search currently supports int8, which is 8 bits, reducing vector index size fourfold.
44
44
45
45
- Binary quantization converts floats into binary bits, which takes up 1 bit. This results in up to 28 times reduced vector index size.
46
46
@@ -228,7 +228,11 @@ An easy way to reduce vector size is to store embeddings in a smaller data forma
228
228
229
229
The `stored` property is a boolean on a vector field definition that determines whether storage is allocated for retrievable vector field content. The `stored` property is true by default. If you don't need vector content in a query response, you can save up to 50 percent storage per field by setting `stored` to false.
230
230
231
-
When evaluating whether to set this property, consider whether you need vectors in the response. Because vectors aren't human readable, they can be omitted in a query response that's rendered on a search page. Keep them, however, if you're using vectors in downstream process that consumes vector content.
231
+
Considerations for setting `stored` to false:
232
+
233
+
- Because vectors aren't human readable, you can omit them from results sent to LLMs in RAG scenarios, and from results that are rendered on a search page. Keep them, however, if you're using vectors in a downstream process that consumes vector content.
234
+
235
+
- However, if your indexing strategy includes [partial document updates](search-howto-reindex.md#update-content), such as "merge" or "mergeOrUpload" on a document, be aware that setting `stored` to false will cause vectors in the non-stored field to be omitted during the merge. On each "merge" or "mergeOrUpload" operation, you must provide the vector fields in addition to other nonvector fields that you're updating, or the vector will be dropped.
232
236
233
237
Remember that the `stored` attribution is irreversible. It's set during index creation on vector fields when physical data structures are created. If you want retrievable vector content later, you must drop and rebuild the index, or create and load a new field that has the new attribution.
0 commit comments