Skip to content

Commit bc0610b

Browse files
committed
more cleanup
1 parent cebcd36 commit bc0610b

File tree

2 files changed

+10
-13
lines changed

2 files changed

+10
-13
lines changed

articles/search/vector-search-how-to-query.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -133,9 +133,9 @@ api-key: {{admin-api-key}}
133133
}
134134
```
135135

136-
### [**2024-05-01-preview**](#tab/query-2024-05-01-preview)
136+
### [**2025-08-01-preview**](#tab/query-2025-08-01-preview)
137137

138-
[**2024-05-01-preview**](/rest/api/searchservice/search-service-api-versions#2024-05-01-preview) is the latest preview API version of [Search - POST](/rest/api/searchservice/documents/search-post?view=rest-searchservice-2024-05-01-preview&tabs=HTTP&preserve-view=true). It supports the same vector query syntax as **2025-09-01**, but it has extra parameters for hybrid search and minimum thresholds for excluding weaker results.
138+
[**2025-08-01-preview**](/rest/api/searchservice/search-service-api-versions#2025-08-01-preview) is the latest preview API version of [Search - POST](/rest/api/searchservice/documents/search-post?view=rest-searchservice-2025-08-01-preview&tabs=HTTP&preserve-view=true). It supports the same vector query syntax as **2025-09-01**, but it has extra parameters for hybrid search and minimum thresholds for excluding weaker results.
139139

140140
This preview adds:
141141

@@ -145,7 +145,7 @@ This preview adds:
145145
In the following example, the vector is a representation of this string: `"what Azure services support full text search"`. The query targets the `contentVector` field and returns `k` results. The actual vector has 1,536 embeddings, which are trimmed in this example for readability.
146146

147147
```http
148-
POST https://{{search-service-name}}.search.windows.net/indexes/{{index-name}}/docs/search?api-version=2024-05-01-preview
148+
POST https://{{search-service-name}}.search.windows.net/indexes/{{index-name}}/docs/search?api-version=2025-08-01-preview
149149
Content-Type: application/json
150150
api-key: {{admin-api-key}}
151151
{

articles/search/vector-search-how-to-storage-options.md

Lines changed: 7 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -88,22 +88,17 @@ Original full-precision vectors are used in rescoring operations over compressed
8888

8989
Rescoring requirements by quantization approach:
9090

91-
- Rscoring of scalar quantized vectors requires retention of the original full-precision vectors.
91+
- Rescoring of scalar quantized vectors requires retention of the original full-precision vectors.
9292

93-
- Rescoring of binary quantized vectors can use original full-precision vectors, or the dot product of the binary embedding, which produces high quality search results, without having to reference full-precision vectors in the index.
93+
- Rescoring of binary quantized vectors can use either the original full-precision vectors, or the dot product of the binary embedding, which produces high quality search results, without having to reference full-precision vectors in the index.
9494

95-
The `rescoreStorageMethod` property controls whether full-precision vectors are stored.
96-
97-
Recommendations:
95+
Rescoring recommendations:
9896

9997
- For scalar quantization, preserve original full-precision vectors in the index because they're required for rescore.
10098

101-
- For binary quantization, preserve original full-precision vectors for the highest quality of rescoring, or discard full-precision vectors if you want to rescore based on the dot product of the binary embeddings.
102-
103-
> [!NOTE]
104-
> Vector storage strategies have been evolving over the last several releases. Index creation date and API version determine your storage options. For example, in the 2024-11-01-preview, if you set discardOriginals to remove full-precision vectors, there was no rescoring for binary quantization because the dot product approach wasn't available. We recommend using the latest APIs for the best mitigation options.
99+
- For binary quantization, either preserve original full-precision vectors for the highest quality of rescoring, or discard full-precision vectors if you want to rescore based on the dot product of the binary embeddings.
105100

106-
In `vectorSearch.compressions`, the `rescoreStorageMethod` property is set to `preserveOriginals` by default, which retains full-precision vectors for [oversampling and rescoring capabilities](vector-search-how-to-quantization.md#add-compressions-to-a-search-index) to reduce the effect of lossy compression on the HNSW graph. If you don't need rescoring, of if you used binary quantization and the dot product for rescoring, you can reduce vector storage by setting `rescoreStorageMethod` to `discardOriginals`.
101+
The `rescoreStorageMethod` property controls whether full-precision vectors are stored. In `vectorSearch.compressions`, the `rescoreStorageMethod` property is set to `preserveOriginals` by default, which retains full-precision vectors for [oversampling and rescoring capabilities](vector-search-how-to-quantization.md#add-compressions-to-a-search-index) to reduce the effect of lossy compression on the HNSW graph. If you don't need rescoring, of if you used binary quantization and the dot product for rescoring, you can reduce vector storage by setting `rescoreStorageMethod` to `discardOriginals`.
107102

108103
> [!IMPORTANT]
109104
> Setting the `rescoreStorageMethod` property is irreversible and can adversely affect search quality, although the degree depends on the compression method and any mitigations you apply.
@@ -177,3 +172,5 @@ To set this property:
177172
}
178173
}
179174
```
175+
> [!NOTE]
176+
> Vector storage strategies have been evolving over the last several releases. Index creation date and API version determine your storage options. For example, in the 2024-11-01-preview, if you set `discardOriginals` to remove full-precision vectors, there was no rescoring for binary quantization because the dot product approach wasn't available. We recommend using the latest APIs for the best mitigation options.

0 commit comments

Comments
 (0)