Build tiny segments on the CPU rather than the GPU #136387

ChrisHegarty · 2025-10-10T13:00:33Z

This commit avoids building tiny segments on the GPU, but rather builds them on the CPU.

We pick the threshold of 10k vectors as the default, lower than this threshold the graph will be built on the CPU. Ultimately I'd prefer to just brute-force below this threshold, but the lucene reader does not yet support this. It should do in the yet-to-be-released Lucene 10.4.

elasticsearchmachine · 2025-10-10T13:00:58Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

…_segments

ldematte

Looks very good!
Some nitpicks and only 1 real comment on the duplication between flush/merge

x-pack/plugin/gpu/src/main/java/org/elasticsearch/xpack/gpu/codec/ES92GpuHnswVectorsFormat.java

ldematte · 2025-10-13T07:58:36Z

x-pack/plugin/gpu/src/main/java/org/elasticsearch/xpack/gpu/codec/ES92GpuHnswVectorsWriter.java

+
+    OnHeapHnswGraph buildGraphWithTheCPU(RandomVectorScorerSupplier scorerSupplier, int numVectors) throws IOException {
+        assert numVectors > 0;
+        var hnswGraphBuilder = HnswGraphBuilder.create(scorerSupplier, M, beamWidth, HnswGraphBuilder.randSeed);


For a follow-up: I saw discussion about adjusting M for CPU vs GPU; should we somehow adjust it?
(Just a reminder, I don't think this should go in this PR)

I think that the graphs here are quite small, so should be fine, but any reference or hints would be gratefully appreciated.

I'll let @mayya-sharipova chime in here -- it's way out of my comfort zone :)

x-pack/plugin/gpu/src/main/java/org/elasticsearch/xpack/gpu/codec/ES92GpuHnswVectorsWriter.java

ldematte · 2025-10-13T08:07:59Z

x-pack/plugin/gpu/src/main/java/org/elasticsearch/xpack/gpu/codec/ES92GpuHnswVectorsWriter.java

+        writeMeta(fieldInfo, vectorIndexOffset, vectorIndexLength, datasetSize, graph, graphLevelNodeOffsets);
+    }
+
+    void createGraphWithCPUAndWriteMeta(FieldInfo fieldInfo, IndexInput input, int size) throws IOException {


This looks very close to flushFieldBuildingGraphOnCPU/generateCPUGraphAndWriteMeta (besides supporting BYTE in merge) -- can we simplify? Or reorganize/give these more specific names? I got a bit lost and had to navigate to the caller to understand the differences.

yeah, they are subtly different. I did try a few different options, but ultimately how we deal with things on the CPU is quite different to the GPU, and I felt that abstracting out things too much hurt readability. Tho, I do agree that it's a bit "windy" to follow! :-(

I'm talking more about how we deal with things in flush and merge.
If I got this correctly, we have
flush -> flushFieldBuildingGraphOnCPU -> generateCPUGraphAndWriteMeta
and
mergeOneField -> createGraphWithCPUAndWriteMeta

Both generateCPUGraphAndWriteMeta and createGraphWithCPUAndWriteMeta call buildGraphWithTheCPU + writeGraphAndMeta; createGraphWithCPUAndWriteMeta looks a lot like generateCPUGraphAndWriteMeta and flushFieldBuildingGraphOnCPU combined, but for the BYTE case.
Maybe they can be merged? Or at least renamed, to make clear one is for the flush case and the other for the merge case?

...in/gpu/src/test/java/org/elasticsearch/xpack/gpu/codec/ES92GpuHnswVectorsFormatCPUTests.java

.../plugin/gpu/src/test/java/org/elasticsearch/xpack/gpu/codec/ThrowingCuVSResourceManager.java

Build tiny segments on the CPU rather than the CPU

c50bdb7

ChrisHegarty requested a review from ldematte October 10, 2025 13:00

ChrisHegarty added :Search Relevance/Vectors Vector search Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch v9.2.1 v9.3.0 labels Oct 10, 2025

ChrisHegarty added the >refactoring label Oct 10, 2025

ldematte changed the title ~~Build tiny segments on the CPU rather than the CPU~~ Build tiny segments on the CPU rather than the GPU Oct 10, 2025

ChrisHegarty added 2 commits October 12, 2025 16:53

itr

e259854

Merge branch 'main' into gpu_tiny_segments

18925f1

ChrisHegarty added the test-gpu Run tests using a GPU label Oct 12, 2025

elasticsearchmachine and others added 6 commits October 12, 2025 16:06

[CI] Auto commit changes from spotless

a5c3ce4

itr

5b93627

Merge remote-tracking branch 'chegar/gpu_tiny_segments' into gpu_tiny…

3521568

…_segments

itr

e4bb1fb

fix test

4194517

comment

9dcdf03

ldematte reviewed Oct 13, 2025

View reviewed changes

ChrisHegarty added 3 commits October 13, 2025 10:44

use Integer.MAX_VALUE for CPU test

4df2570

review comment - arg order

8ae6178

tests

d83b55f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Build tiny segments on the CPU rather than the GPU #136387

Build tiny segments on the CPU rather than the GPU #136387

ChrisHegarty commented Oct 10, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 10, 2025

Uh oh!

ldematte left a comment

Uh oh!

Uh oh!

ldematte Oct 13, 2025

Uh oh!

ChrisHegarty Oct 13, 2025

Uh oh!

ldematte Oct 13, 2025

Uh oh!

Uh oh!

ldematte Oct 13, 2025

Uh oh!

ChrisHegarty Oct 13, 2025

Uh oh!

ldematte Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Build tiny segments on the CPU rather than the GPU #136387

Are you sure you want to change the base?

Build tiny segments on the CPU rather than the GPU #136387

Conversation

ChrisHegarty commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 10, 2025

Uh oh!

ldematte left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ldematte Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

ChrisHegarty Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ldematte Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

ChrisHegarty Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ChrisHegarty commented Oct 10, 2025 •

edited

Loading