Can we support vectors to be loaded with direct I/O for full precision re-ranking?

### Description

Spin-off from discussion in https://github.com/apache/lucene/pull/14708. One of the concern with with full precision (FP) re-ranking (for quantized vectors) is that if we use off-heap vector reader it will page-in the FP vector data and can compete with quantized vector data which are used for HNSW graph search. As HNSW will suffer the performance greatly if the vectors are not in memory, for instance with limited memory, can we support a mode to let the FP vectors be loaded with direct I/O? (Or if this is already possible?)

For integrating with the existing quantized vectors codec, is my understanding correct that we will need to create a new codec/vector reader that extend from the [existing reader](https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/codecs/lucene102/Lucene102BinaryQuantizedVectorsFormat.java#L126C16-L126C31) and use a different raw vector format?

I can try this, but wondering what the community think about it. Is there other use case that needs a on-heap direct I/O vector readers as well?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can we support vectors to be loaded with direct I/O for full precision re-ranking? #14746

Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Can we support vectors to be loaded with direct I/O for full precision re-ranking? #14746

Description

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions