You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/develop/ai/search-and-query/vectors/_index.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -161,7 +161,7 @@ Choose the `SVS-VAMANA` index type when all of the following requirements apply:
161
161
|`LEANVEC_DIM`| The dimension used when using `LeanVec4x8` or `LeanVec8x8` compression for dimensionality reduction. If a value is provided, it should be less than `DIM`. Lowering it can speed up search and reduce memory use. |`DIM / 2`|
162
162
163
163
{{< warning >}}
164
-
Intel's proprietary LVQ and LeanVec optimizations are not available on Redis Open Source. On non-Intel platforms and Redis Open Source platforms, `SVS-VAMANA` with `COMPRESSION` will fall back to Intel’s basic, 8-bit scalar quantization implementation: all values in a vector are scaled using the global minimum and maximum, and then each dimension is quantized independently into 256 levels using 8-bit precision.
164
+
Some advanced vector compression features may depend on hardware or Intel's proprietary optimizations. Intel's proprietary LVQ and LeanVec optimizations are not available in Redis Open Source. On non-Intel platforms and Redis Open Source platforms, `SVS-VAMANA` with `COMPRESSION` will fall back to basic, 8-bit scalar quantization implementation: all values in a vector are scaled using the global minimum and maximum, and then each dimension is quantized independently into 256 levels using 8-bit precision.
Copy file name to clipboardExpand all lines: content/develop/ai/search-and-query/vectors/svs-compression.md
+14-12Lines changed: 14 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,6 @@
1
1
---
2
+
aliases:
3
+
2
4
categories:
3
5
- docs
4
6
- develop
@@ -9,23 +11,23 @@ categories:
9
11
- oss
10
12
- kubernetes
11
13
- clients
12
-
description: Vector compression and quantization for efficient memory usage and search performance
13
-
linkTitle: Vector Compression & Quantization
14
-
title: Vector Compression and Quantization
14
+
description: Vector quantization and compression for efficient memory usage and search performance
15
+
linkTitle: Quantization and compression
16
+
title: Vector quantization and compression
15
17
weight: 2
16
18
---
17
19
18
-
Efficient management of high-dimensional vector data is crucial for scalable search and retrieval. Advanced methods for vector compression and quantization—such as LVQ (Locally-Adaptive Vector Quantization) and LeanVec—can dramatically optimize memory usage and improve search speed, without sacrificing too much accuracy. This page describes practical approaches to compressing and quantizing vectors for scalable search.
20
+
Efficient management of high-dimensional vector data is crucial for scalable search and retrieval. Advanced methods for vector quantization and compression, such as LVQ (Locally-adaptive Vector Quantization) and LeanVec, can dramatically optimize memory usage and improve search speed, without sacrificing much accuracy. This page describes practical approaches to quantizing and compressing vectors for scalable search.
19
21
20
22
{{< warning >}}
21
-
Some advanced vector compression features may depend on hardware or Intel's proprietary optimizations. Intel's proprietary LVQ and LeanVec optimizations are not available on Redis Open Source. On non-Intel platforms and Redis Open Source platforms, `SVS-VAMANA` with `COMPRESSION` will fall back to basic, 8-bit scalar quantization implementation: all values in a vector are scaled using the global minimum and maximum, and then each dimension is quantized independently into 256 levels using 8-bit precision.
23
+
Some advanced vector compression features may depend on hardware or Intel's proprietary optimizations. Intel's proprietary LVQ and LeanVec optimizations are not available in Redis Open Source. On non-Intel platforms and Redis Open Source platforms, `SVS-VAMANA` with `COMPRESSION` will fall back to basic, 8-bit scalar quantization implementation: all values in a vector are scaled using the global minimum and maximum, and then each dimension is quantized independently into 256 levels using 8-bit precision.
22
24
{{< /warning >}}
23
25
24
-
## Compression and Quantization Techniques
26
+
## Quantization and compression techniques
25
27
26
-
### LVQ (Locally-Adaptive Vector Quantization)
28
+
### LVQ (Locally-adaptive Vector Quantization)
27
29
28
-
***Method:** Applies per-vector normalization and scalar quantization, learning parameters directly from the data.
30
+
***Method:** Applies per-vector normalization and scalar quantization; learns parameters directly from the data.
29
31
***Advantages:**
30
32
* Enables fast, on-the-fly distance computations.
31
33
* SIMD-optimized layout for efficient search.
@@ -35,7 +37,7 @@ Some advanced vector compression features may depend on hardware or Intel's prop
35
37
***LVQ8:** Faster ingestion, slower search.
36
38
***LVQ4x8:** Two-level quantization for improved recall.
37
39
38
-
### LeanVec
40
+
### LeanVec (LVQ with dimensionality reduction)
39
41
40
42
***Method:** Combines dimensionality reduction with LVQ, applying quantization after reducing vector dimensions.
41
43
***Advantages:**
@@ -44,11 +46,11 @@ Some advanced vector compression features may depend on hardware or Intel's prop
44
46
***Variants:**
45
47
***LeanVec4x8:** Recommended for high-dimensional datasets, fastest search and ingestion.
46
48
***LeanVec8x8:** Improved recall when more granularity is needed.
47
-
***LeanVec Dimension:** For faster search and lower memory usage, reduce the dimension further by using the optional `REDUCE` argument. The default is typically `input dimension / 2`, but more aggressive reduction (such as `dimension / 4`) is possible for greater efficiency.
49
+
***LeanVec Dimension:** For faster search and lower memory usage, reduce the dimension further by using the optional `REDUCE` argument. The default is typically `input dimension / 2`, but more aggressive reduction (such as `input dimension / 4`) is possible for greater efficiency.
0 commit comments