Optimize DirectIO prefetch for monotonically increasing access #136946

iverase · 2025-10-22T09:44:26Z

This PR proposes a new implementation for DirectIO prefetching that is optimised for access in monotonically increase order which is the typical access when doing vector rescoring.

elasticsearchmachine · 2025-10-22T09:44:51Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

iverase · 2025-10-22T10:25:58Z

Move to draft and there is something odd. We are in places aligning the data to the blockSize and my change align data to the prefetch block size.

I guess that's really the change in the approach. What it makes very difficult to maintain slots in the current approach is that we align data using the block size but then we try to fit it in prefetch blocks of different size so a byte can be loaded differently depending on the alignment. Aligning the data to the prefetch block size ensures a position on the file will be always add to the same position on a prefetch block. I guess everything works because in my machine the prefetch block size = 2 * block size.

benwtrent · 2025-10-22T21:41:40Z

I wonder if it's better to just check the 'floor' of already requested blocks when potentially adding a position slot.

I think the "grabbing oldest" is likely ok logic if you want to split it out of this PR

iverase · 2025-10-23T09:30:46Z

I made data to be blockSize align in 3f32209

benwtrent

I like this. have you benchmarked it to see if it helps?

iverase · 2025-10-30T11:56:19Z

I like this. have you benchmarked it to see if it helps?

I tried with some local examples and it helped quite a lot, of course it was a case the current code was doing very bad. I will try to get a realistic example.

benwtrent · 2025-11-04T14:31:36Z

@iverase I tried with vector rescoring and this PR seems to make things slower. Could you confirm? Maybe I have benchmarking bias here :)

benwtrent · 2025-11-06T20:58:47Z

I ran more benchmarks, this does seem faster.

this pr

min: 204.92
max: 355.87
50th_percentile: 271.39

baseline:

min: 163.13
max: 270.27
50th_percentile: 216.45

If you agree, I think its good to merge.

…ic#136946)

Optimize DirectIO prefetch for monotonically increasing access

507dc08

iverase requested a review from benwtrent October 22, 2025 09:44

iverase added >non-issue :Search Relevance/Vectors Vector search v9.3.0 labels Oct 22, 2025

iter

961f2fe

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Oct 22, 2025

iverase marked this pull request as draft October 22, 2025 10:24

iverase added 2 commits October 23, 2025 11:29

align to BlockSize

3f32209

Merge branch 'main' into DirectIOPrefetch

c8f3543

iverase marked this pull request as ready for review October 23, 2025 09:30

iverase added 2 commits October 23, 2025 11:56

improve test

c0c980f

iter

cffc513

benwtrent approved these changes Oct 29, 2025

View reviewed changes

Merge branch 'main' into DirectIOPrefetch

214e6b1

Merge branch 'main' into DirectIOPrefetch

0e9b60b

iverase merged commit 1d9ab12 into elastic:main Nov 7, 2025
34 checks passed

iverase deleted the DirectIOPrefetch branch November 7, 2025 08:53

Kubik42 pushed a commit to Kubik42/elasticsearch that referenced this pull request Nov 10, 2025

Optimize DirectIO prefetch for monotonically increasing access (elast…

e3ed6fe

…ic#136946)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize DirectIO prefetch for monotonically increasing access #136946

Optimize DirectIO prefetch for monotonically increasing access #136946

Uh oh!

iverase commented Oct 22, 2025

Uh oh!

elasticsearchmachine commented Oct 22, 2025

Uh oh!

iverase commented Oct 22, 2025 •

edited

Loading

Uh oh!

benwtrent commented Oct 22, 2025

Uh oh!

iverase commented Oct 23, 2025 •

edited

Loading

Uh oh!

benwtrent left a comment

Uh oh!

iverase commented Oct 30, 2025

Uh oh!

benwtrent commented Nov 4, 2025

Uh oh!

benwtrent commented Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize DirectIO prefetch for monotonically increasing access #136946

Optimize DirectIO prefetch for monotonically increasing access #136946

Uh oh!

Conversation

iverase commented Oct 22, 2025

Uh oh!

elasticsearchmachine commented Oct 22, 2025

Uh oh!

iverase commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benwtrent commented Oct 22, 2025

Uh oh!

iverase commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

iverase commented Oct 30, 2025

Uh oh!

benwtrent commented Nov 4, 2025

Uh oh!

benwtrent commented Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iverase commented Oct 22, 2025 •

edited

Loading

iverase commented Oct 23, 2025 •

edited

Loading