Skip to content

Conversation

ldematte
Copy link
Contributor

This PR makes a small change to improve parallelism during graph build which we noticed with NVIDIA from profiler traces.
In case the resulting graph is "small enough" (where "small enough" is ATM set to 128 MB) we copy the graph entirely to host memory, release the cuvs resources and proceed, instead of downloading data in pages from the device and write each page to disk, which is more efficient but will hold the resources till we finished writing to disk -- on a busy system this can require time.

@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-search-relevance (Team:Search Relevance)

@elasticsearchmachine elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Oct 13, 2025
Copy link
Contributor

@ChrisHegarty ChrisHegarty left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense. LGTM.

@ldematte ldematte removed the test-gpu Run tests using a GPU label Oct 13, 2025
@ldematte
Copy link
Contributor Author

Update without test-gpu to verify this works well with "regular" CI

@ldematte ldematte enabled auto-merge (squash) October 13, 2025 10:43
@ldematte ldematte merged commit ca5ca98 into elastic:main Oct 13, 2025
34 checks passed
georgewallace pushed a commit to georgewallace/elasticsearch that referenced this pull request Oct 13, 2025
…ly (elastic#136464)

This PR makes a small change to improve parallelism during graph build which we noticed with NVIDIA from profiler traces.
In case the resulting graph is "small enough" (where "small enough" is ATM set to 128 MB) we copy the graph entirely to host memory, release the cuvs resources and proceed, instead of downloading data in pages from the device and write each page to disk, which is more efficient but will hold the resources till we finished writing to disk -- on a busy system this can require time.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>non-issue :Search Relevance/Vectors Vector search Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch v9.2.1 v9.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants