-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Labels
Description
Qdrant client and Qdrant Cloud now offer cloud inference with Fastembed, allowing users to utilize the same models while offloading embedding computations to the cloud.
Feature request:
- Integrate Qdrant Cloud Inference as an embedding provider in Codeweaver.
- Users should be able to select Qdrant Cloud Inference as an embedding backend, in addition to currently supported local or self-hosted options.
- Ensure configuration options allow switching easily between local inference and cloud inference.
- Provide documentation for setup and usage, including API credentials, supported models, and usage examples.
Benefits:
- Simplifies scaling: Users can leverage Qdrant's managed infrastructure to handle increased embedding workloads.
- Consistency: Enables users to use the same models irrespective of local or cloud execution.
- Reduced local resource consumption: Cloud inference offloads compute from client machines.
References:
- Qdrant client documentation on Cloud Inference
- Fastembed integration details
Acceptance criteria:
- Qdrant Cloud Inference integration is available and easily selectable.
- End-user documentation is updated and tested.
- Includes basic tests to validate provider selection and inference results.