Skip to content

Add support for Qdrant Cloud Inference as an embedding provider #134

@bashandbone

Description

@bashandbone

Qdrant client and Qdrant Cloud now offer cloud inference with Fastembed, allowing users to utilize the same models while offloading embedding computations to the cloud.

Feature request:

  • Integrate Qdrant Cloud Inference as an embedding provider in Codeweaver.
  • Users should be able to select Qdrant Cloud Inference as an embedding backend, in addition to currently supported local or self-hosted options.
  • Ensure configuration options allow switching easily between local inference and cloud inference.
  • Provide documentation for setup and usage, including API credentials, supported models, and usage examples.

Benefits:

  • Simplifies scaling: Users can leverage Qdrant's managed infrastructure to handle increased embedding workloads.
  • Consistency: Enables users to use the same models irrespective of local or cloud execution.
  • Reduced local resource consumption: Cloud inference offloads compute from client machines.

References:

  • Qdrant client documentation on Cloud Inference
  • Fastembed integration details

Acceptance criteria:

  • Qdrant Cloud Inference integration is available and easily selectable.
  • End-user documentation is updated and tested.
  • Includes basic tests to validate provider selection and inference results.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions