Sq8 dist functions L2 [MOD-13169] #877

dor-forer · 2026-01-05T08:45:04Z

Describe the changes in the pull request

Added support to SQ8 to SQ8 L2 spaces.
Intel:
AVX512_F_BW_VL_VNNI - needs it to support uint8 and in8 operations.
ARM:
SVE
NEON

The cosine functions assume that the vectors are normlized therefore don't divide by the norm.
Which issues this PR fixes

MOD-13169

Main objects this PR modified

...
...

Mark if applicable

This PR introduces API changes
This PR introduces serialization changes

- Implemented inner product and cosine distance functions for SQ8-to-SQ8 vectors in SVE, NEON, and AVX512 architectures. - Added corresponding distance function selection logic in IP_space.cpp and function headers in IP_space.h. - Created benchmarks for SQ8-to-SQ8 distance functions to evaluate performance across different architectures. - Developed unit tests to validate the correctness of the new distance functions against expected results. - Ensured compatibility with existing optimization features for various CPU architectures.

…mproving performance

…ions

… SQ8-to-SQ8 calculations

… NEON and AVX512 headers

… function

- Implemented NEON, SVE, and AVX512F optimized functions for calculating L2 squared distance between SQ8 (scalar quantized 8-bit) vectors. - Introduced helper functions for processing vector elements using NEON and SVE intrinsics. - Updated L2_space.cpp and L2_space.h to include new distance function for SQ8-to-SQ8. - Enhanced AVX512F, NEON, and SVE function selectors to choose the appropriate implementation based on CPU features. - Added unit tests to validate the correctness of the new L2 squared distance functions. - Updated benchmark tests to include performance measurements for the new implementations.

…ocumentation accordingly

…tance assertion tolerance

…onsistency

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

…tions

…ation

… using AVX512 VNNI; add benchmarks and tests for new functionality

…pulation

…VE, and AVX512; add corresponding selection functions and update tests for consistency.

…update benchmarks and tests for new functionality

- Updated distance function declarations in IP_space.h to clarify that SQ8-to-SQ8 functions use precomputed sum/norm. - Removed precomputed distance function implementations for AVX512F, NEON, and SVE architectures from their respective source files. - Adjusted benchmark tests to remove references to precomputed distance functions and ensure they utilize the updated quantization methods. - Modified utility functions to support the creation of SQ8 quantized vectors with precomputed sum and norm. - Updated unit tests to reflect changes in the quantization process and removed tests specifically for precomputed distance functions.

…nsistency - Updated include paths in AVX512F_BW_VL_VNNI.cpp to reflect new naming conventions. - Modified unit tests in test_spaces.cpp to streamline vector initialization and quantization processes. - Replaced repetitive code with utility functions for populating and quantizing vectors. - Enhanced assertions in tests to ensure optimized distance functions are correctly chosen and validated. - Removed unnecessary parameters from utility functions to simplify their interfaces. - Improved test coverage for edge cases, including zero and constant vectors, ensuring accuracy across various scenarios.

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

…proved clarity

…ions

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

…d accuracy

… ARM architecture

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

…to dorer-sq8-dist-functions-l2

…implementation

Copilot

Pull request overview

This PR implements L2 squared distance functions for SQ8-to-SQ8 vector comparisons (where both vectors are scalar quantized to 8-bit integers). The implementation leverages the mathematical identity ||x - y||² = ||x||² + ||y||² - 2*IP(x, y) to efficiently compute L2 distance by reusing existing inner product implementations.

Key changes:

Added SQ8-to-SQ8 L2 distance functions with SIMD optimizations for multiple architectures (SVE, SVE2, NEON, NEON_DOTPROD, AVX512)
Refactored SQ8_SQ8_InnerProduct to extract a common implementation function that returns raw inner product values, enabling reuse for L2 calculations
Added comprehensive unit tests and edge case tests for the new L2 functionality
Updated benchmarks to include L2 distance measurements for SQ8-to-SQ8 operations

Reviewed changes

Copilot reviewed 23 out of 23 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/VecSim/spaces/L2_space.h	Added declaration for L2_SQ8_SQ8_GetDistFunc dispatcher
src/VecSim/spaces/L2_space.cpp	Implemented L2_SQ8_SQ8_GetDistFunc to select appropriate L2 implementation based on CPU features
src/VecSim/spaces/L2/L2.h	Added SQ8_SQ8_L2Sqr function declaration
src/VecSim/spaces/L2/L2.cpp	Implemented base SQ8_SQ8_L2Sqr using the L2 identity formula
src/VecSim/spaces/L2/L2_SVE_SQ8_SQ8.h	Added SVE-optimized L2 implementation
src/VecSim/spaces/L2/L2_NEON_SQ8_SQ8.h	Added NEON-optimized L2 implementation
src/VecSim/spaces/L2/L2_NEON_DOTPROD_SQ8_SQ8.h	Added NEON DOTPROD-optimized L2 implementation
src/VecSim/spaces/L2/L2_AVX512F_BW_VL_VNNI_SQ8_SQ8.h	Added AVX512-optimized L2 implementation
src/VecSim/spaces/functions/SVE.h	Added Choose_SQ8_SQ8_L2_implementation_SVE declaration
src/VecSim/spaces/functions/SVE.cpp	Implemented SVE L2 chooser function
src/VecSim/spaces/functions/SVE2.h	Added Choose_SQ8_SQ8_L2_implementation_SVE2 declaration
src/VecSim/spaces/functions/SVE2.cpp	Implemented SVE2 L2 chooser function
src/VecSim/spaces/functions/NEON.h	Added Choose_SQ8_SQ8_L2_implementation_NEON declaration
src/VecSim/spaces/functions/NEON.cpp	Implemented NEON L2 chooser function
src/VecSim/spaces/functions/NEON_DOTPROD.h	Added Choose_SQ8_SQ8_L2_implementation_NEON_DOTPROD declaration
src/VecSim/spaces/functions/NEON_DOTPROD.cpp	Implemented NEON_DOTPROD L2 chooser function
src/VecSim/spaces/functions/AVX512F_BW_VL_VNNI.h	Added Choose_SQ8_SQ8_L2_implementation_AVX512F_BW_VL_VNNI declaration
src/VecSim/spaces/functions/AVX512F_BW_VL_VNNI.cpp	Implemented AVX512 L2 chooser function
src/VecSim/spaces/IP/IP.h	Added SQ8_SQ8_InnerProduct_Impl declaration for shared implementation
src/VecSim/spaces/IP/IP.cpp	Refactored SQ8_SQ8_InnerProduct to extract common implementation returning raw inner product
tests/utils/tests_utils.h	Added SQ8_SQ8_NotOptimized_L2Sqr helper for testing non-optimized L2 calculation
tests/unit/test_spaces.cpp	Added comprehensive L2 tests including optimization tests and edge cases (self-distance, symmetry, zero/constant vectors, extreme values)
tests/benchmark/spaces_benchmarks/bm_spaces_sq8_sq8.cpp	Updated benchmarks to include L2 measurements alongside existing IP benchmarks

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/VecSim/spaces/L2_space.cpp

Copilot

Pull request overview

Copilot reviewed 23 out of 23 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/benchmark/spaces_benchmarks/bm_spaces_sq8_sq8.cpp

…rics

codecov · 2026-01-05T12:56:34Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.06%. Comparing base (4925862) to head (3b38d8e).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #877      +/-   ##
==========================================
+ Coverage   97.05%   97.06%   +0.01%     
==========================================
  Files         127      128       +1     
  Lines        7560     7586      +26     
==========================================
+ Hits         7337     7363      +26     
  Misses        223      223

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

meiravgri · 2026-01-06T09:06:15Z

src/VecSim/spaces/L2/L2.cpp

+
+    // Get precomputed sum of squares from both vectors
+    // Layout: [uint8_t values (dim)] [min_val] [delta] [sum] [sum_of_squares]
+    const float sum_sq_1 = *reinterpret_cast<const float *>(pVect1 + dimension + 3 * sizeof(float));


We should have a macro/ enum of the metadata indexes instead of hardcoding them

I was just thinking about it.
Maybe I will add it in the renaming pr

dor-forer added 30 commits December 28, 2025 09:37

Add SQ8-to-SQ8 benchmark tests and update related scripts

8697a3e

Format

e0ce268

Orgnizing

ab6b077

Add full sq8 bencharks

931e339

Optimize the sq8 sq8

a56474d

Optimize SQ8 distance functions for NEON by reducing operations and i…

a25f45c

…mproving performance

format

0ad941e

Add NEON DOTPROD-optimized distance functions for SQ8-to-SQ8 calculat…

68cd068

…ions

PR

0b4b568

Remove NEON DOTPROD-optimized distance functions for INT8, UINT8, and…

d0fd2e4

… SQ8-to-SQ8 calculations

Fix vector layout documentation by removing inv_norm from comments in…

9de6163

… NEON and AVX512 headers

Remove 'constexpr' from ones vector declaration in NEON inner product…

63a46a1

… function

Change the name

5bef023

Add full range tests for SQ8 distance functions with SIMD optimizations

72053af

Refactor distance functions to remove inv_norm parameter and update d…

525f8da

…ocumentation accordingly

Update SQ8 Cosine test to normalize both input vectors and adjust dis…

13a477b

…tance assertion tolerance

Rename 'compressed' to 'quantized' in SQ8 functions for clarity and c…

c18000e

…onsistency

Merge branch 'dorer-sq8-dist-functions-ip-cosine' of https://github.c…

b58f8ef

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

Rename 'compressed' to 'quantized' in SQ8 distance tests for clarity

286990a

Refactor quantization function to remove unused normalization calcula…

8cdc3fc

…tions

Add TODO to store vector's norm and sum in L2 squared distance calcul…

189290e

…ation

Implement SQ8-to-SQ8 distance functions with precomputed sum and norm…

bbf810e

… using AVX512 VNNI; add benchmarks and tests for new functionality

Add edge case tests for SQ8-to-SQ8 precomputed cosine distance functions

dbbb7d9

Refactor SQ8 test cases to use CreateSQ8QuantizedVector for vector po…

36ab068

…pulation

Implement SQ8-to-SQ8 precomputed distance functions using ARM NEON, S…

00617d7

…VE, and AVX512; add corresponding selection functions and update tests for consistency.

Implement SQ8-to-SQ8 precomputed inner product and cosine functions; …

4331d91

…update benchmarks and tests for new functionality

dor-forer added 12 commits January 4, 2026 15:42

Merge branch 'dorer-sq8-dist-functions-ip-cosine' of https://github.c…

a0796db

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

Refactor SQ8 cosine function to utilize inner product function for im…

a4ff5d0

…proved clarity

Remove redundant inner product edge case tests for SQ8 distance funct…

c22158f

…ions

Add SVE2 support to SQ8-to-SQ8 Inner Product distance function

4c19d9e

Merge branch 'dorer-sq8-dist-functions-ip-cosine' of https://github.c…

e2ad287

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

Fix SQ8_Cosine to call the correct inner product function for improve…

668315b

…d accuracy

Remove SVE2 and other optimizations from SQ8 cosine function test for…

5c22af8

… ARM architecture

Merge branch 'dorer-sq8-dist-functions-ip-cosine' of https://github.c…

ad515ba

…om/RedisAI/VectorSimilarity into dorer-sq8-dist-functions-l2

Merge branch 'main' of https://github.com/RedisAI/VectorSimilarity in…

695bbc0

…to dorer-sq8-dist-functions-l2

Add L2 distance function without optimizations for testing purposes

cae2dd6

Refactor L2 distance function and update test assertions for precision

b2506b9

Update L2 squared distance functions to support 64 residuals in NEON …

59784db

…implementation

dor-forer requested a review from Copilot January 5, 2026 08:45

Copilot started reviewing on behalf of dor-forer January 5, 2026 08:45 View session

Copilot AI reviewed Jan 5, 2026

View reviewed changes

src/VecSim/spaces/L2_space.cpp Outdated Show resolved Hide resolved

src/VecSim/spaces/L2_space.cpp Outdated Show resolved Hide resolved

Refactor L2 distance function conditions for NEON optimizations

8d24786

dor-forer requested a review from Copilot January 5, 2026 12:32

Copilot started reviewing on behalf of dor-forer January 5, 2026 12:32 View session

Adjust NEON_DOTPROD benchmark initialization to use a dimension of 16

0dde4d5

dor-forer marked this pull request as ready for review January 5, 2026 12:36

Copilot AI reviewed Jan 5, 2026

View reviewed changes

tests/benchmark/spaces_benchmarks/bm_spaces_sq8_sq8.cpp Outdated Show resolved Hide resolved

dor-forer added the bm-spaces-sq8-full label Jan 5, 2026

dor-forer requested a review from meiravgri January 5, 2026 12:37

Update NEON benchmarks to support 64 dimensions for L2 and Cosine met…

3b38d8e

…rics

meiravgri approved these changes Jan 6, 2026

View reviewed changes

dor-forer added this pull request to the merge queue Jan 6, 2026

Merged via the queue into main with commit df8dbe2 Jan 6, 2026
24 checks passed

dor-forer deleted the dorer-sq8-dist-functions-l2 branch January 6, 2026 09:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sq8 dist functions L2 [MOD-13169] #877

Sq8 dist functions L2 [MOD-13169] #877

Uh oh!

dor-forer commented Jan 5, 2026 •

edited by atlassian bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

codecov bot commented Jan 5, 2026 •

edited

Loading

Uh oh!

meiravgri Jan 6, 2026

Uh oh!

dor-forer Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sq8 dist functions L2 [MOD-13169] #877

Sq8 dist functions L2 [MOD-13169] #877

Uh oh!

Conversation

dor-forer commented Jan 5, 2026 • edited by atlassian bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

codecov bot commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

meiravgri Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

dor-forer Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dor-forer commented Jan 5, 2026 •

edited by atlassian bot

Loading

codecov bot commented Jan 5, 2026 •

edited

Loading