Skip to content

Enable DirectIO for highest fidelity vectors for all quantized vector indices #134479

@benwtrent

Description

@benwtrent

We need to provide an index time configuration item for enabling DirectIO for all our quantized vector types. Currently, its possible to only enable direct IO through a java feature flag. This of course is difficult for users to enable and it applies to only BBQ vector types and does so indiscriminately.

We must be able to enable DirectIO for the high fidelity vectors provided when quantized search is enabled and reranking must take place.

Since this should be possible for all quantized values, and we want to be able to quantize from bytes, and have bfloat16 vectors, I am not sure a fully qualified format name will be scalable or usable for this.

I think we will need fully composable, pluggable, and backwards compatible formats: #134394

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions