Embedding Service by EmanueleDeRossi1 · Pull Request #1286 · rhesis-ai/rhesis

EmanueleDeRossi1 · 2026-02-09T10:36:00Z

This PR introduces changes from the feature/embedding-service branch.

📝 Summary

📁 Files Changed ( 17 files)

.github/workflows/backend-test.yml
apps/backend/pyproject.toml
apps/backend/src/rhesis/backend/alembic/versions/a1b2c3d4e5f8_create_embedding_table.py
apps/backend/src/rhesis/backend/app/models/__init__.py
apps/backend/src/rhesis/backend/app/models/embedding.py
apps/backend/src/rhesis/backend/app/models/mixins.py
apps/backend/src/rhesis/backend/app/models/source.py
apps/backend/src/rhesis/backend/app/models/test.py
apps/backend/src/rhesis/backend/app/services/async_service.py
apps/backend/src/rhesis/backend/app/services/initial_data.json
apps/backend/src/rhesis/backend/app/services/telemetry/enrichment/service.py
apps/backend/uv.lock
docker-compose.yml
docs/content/development/index.mdx
infrastructure/k8s/charts/rhesis/values.yaml
tests/README.md
tests/sdk/integration/docker-compose.yml

📋 Commit Details

ea914325f - refactor(telemetry): migrate EnrichmentService to AsyncService base (Emanuele De Rossi, 2026-02-09 11:35)
60b8f5515 - feat(services): add AsyncService base class for async/sync task orchestration (Emanuele De Rossi, 2026-02-09 11:33)
0bc7d2d43 - fix: remove ABC inheritance from EmbeddableMixin (Emanuele De Rossi, 2026-02-04 09:54)
90d40f2cc - feat: add EmbeddableMixin for searchable text generation (Emanuele De Rossi, 2026-02-04 09:03)
97c78e0bf - feat: enhance embedding table with multi-dimension support and full-text search (Emanuele De Rossi, 2026-02-03 15:44)
0bb1a9f08 - feat(db): enable pgvector extension for vector storage (Emanuele De Rossi, 2026-02-03 11:37)
616d94472 - feat: add multi-dimension embedding storage system (Emanuele De Rossi, 2026-02-02 09:14)

✅ Checklist

Code follows the project's style guidelines
Self-review of code has been performed
Code is commented, particularly in hard-to-understand areas
Corresponding changes to documentation have been made
Tests have been added/updated for new functionality
All tests pass locally

🧪 Testing

📸 Screenshots (if applicable)

🔗 Related Issues

- Rename get_model() → get_language_model() and get_embedder() → get_embedding_model() - Rename ModelConfig → LanguageModelConfig and EmbedderConfig → EmbeddingModelConfig - Keep deprecated aliases for backward compatibility

… and backend

- Renamed get_model() to get_language_model() across SDK - Renamed DEFAULT_MODEL_NAME to DEFAULT_LANGUAGE_MODEL_NAME in all providers - Renamed PROVIDER_REGISTRY to LANGUAGE_MODEL_PROVIDER_REGISTRY

Rename DEFAULT_GENERATION_MODEL → DEFAULT_LANGUAGE_MODEL_PROVIDER and DEFAULT_MODEL_NAME → DEFAULT_LANGUAGE_MODEL_NAME across all services

- rename model_type to purpose in _get_user_model and related functions to avoid confusion between model_type terminology (which refers to whether model is either language/embedding model)

Add Rhesis as the default embedding model provider, following the same pattern as the language model: Backend changes: - Update constants to use consistent naming (DEFAULT_EMBEDDING_MODEL_PROVIDER) - Create default Rhesis embedding model during organization initialization - Store both language_model_id and embedding_model_id in user settings - Update generate/embedding endpoint to use new constants SDK changes: - Implement complete RhesisEmbedder class with generate() and generate_batch() - Add factory function for Rhesis embedding model - Register "rhesis" provider in EMBEDDING_MODEL_REGISTRY - Update DEFAULT_EMBEDDING_MODEL_PROVIDER from "openai" to "rhesis" This enables users to use Rhesis-hosted embeddings by default while still allowing custom embedding model configuration.

… connection

- use correct import (DEFAULT_LANGUAGE_MODEL_PROVIDER) in tests - remove unused aliases (DEFAULT_MODELS, DEFAULT_PROVIDER)

Implement dedicated Embedding table with support for multiple vector dimensions (768, 1536, 3072) using separate columns for better performance and type safety. Embeddings have a polymorphic relationship pattern, now supporting Test and Source entities. Key changes: - Add Embedding model with vector_768, vector_1536, vector_3072 columns - Add pgvector dependency for PostgreSQL vector operations - Create Alembic migration with HNSW indexes - Add polymorphic embeddings relationship to Test and Source models - Add constraint ensuring exactly one vector column is populated per record

- Update PostgreSQL images from postgres:16-alpine to pgvector/pgvector:pg16 across all environments (local, k8s, CI/CD)

…ext search Add support for 384, 768, 1024, and 1536-dimensional embeddings with pgvector HNSW indexes. Introduce embedding configuration tracking with config_hash, text_hash, status and weight fields Add EmbeddingConfig utility class and improved property accessors for dynamic column selection. Update initial data

Add abstract mixin requiring entities to implement to_searchable_text() for embeddings and full-text search. Implement for Source and Test models.

This fixes the metaclass conflict where Test was inherit both from Base (which uses sqlachlemy's DeclaritiveMeta metaclass) and ABC (which uses ABCMeta metaclass)

…stration Add reusable base class that provides: - Celery worker availability checking with timeout - Automatic fallback from async to sync execution - Batch processing with single worker check optimization - Generic typing support for different return types This pattern enables services to seamlessly work in both production (with Celery workers) and development (without workers) environments.

- Extend AsyncService for standardized async/sync orchestration - Replace custom worker checking with inherited implementation - Simplify enqueue_enrichment using execute_with_fallback - Use batch_execute for efficient multi-trace processing

- Add dimension column to Model table - Auto-detect and store embedding dimensions during connection test

…ltiple identical active embeddings

…int param The endpoint parameter was never used for embedding models, only for LLMs. Remove it from _test_embedding_connection

Implement EmbeddingGenerator class to generate and manage embeddings for any embeddable entity in the system. Key features: - Generate embeddings using configurable embedding models - Compute content and configuration hashes for deduplication - Automatic detection and marking of stale embeddings - Support for multiple entity types with dynamic model lookup - Proper error handling for invalid entity types and missing entities

EmanueleDeRossi1 self-assigned this Feb 9, 2026

EmanueleDeRossi1 marked this pull request as draft February 9, 2026 10:37

EmanueleDeRossi1 force-pushed the feature/embedding-service branch from ea91432 to f3254fc Compare February 11, 2026 16:13

EmanueleDeRossi1 had a problem deploying to test February 11, 2026 16:13 — with GitHub Actions Failure

EmanueleDeRossi1 force-pushed the feature/embedding-service branch from f3254fc to 79338f8 Compare February 12, 2026 16:01

EmanueleDeRossi1 had a problem deploying to test February 12, 2026 16:01 — with GitHub Actions Failure

EmanueleDeRossi1 had a problem deploying to test February 12, 2026 16:05 — with GitHub Actions Failure

EmanueleDeRossi1 had a problem deploying to test February 13, 2026 13:30 — with GitHub Actions Failure

EmanueleDeRossi1 added 22 commits February 13, 2026 17:59

feat: consistently use 'language model' instead of 'llm model'

953fae2

feat: use consistent naming for language and embedding model in tests…

a6af4c1

… and backend

feat(sdk): get_model to get_language_model for clarity

4bdb571

- Renamed get_model() to get_language_model() across SDK - Renamed DEFAULT_MODEL_NAME to DEFAULT_LANGUAGE_MODEL_NAME in all providers - Renamed PROVIDER_REGISTRY to LANGUAGE_MODEL_PROVIDER_REGISTRY

feat(test): update tests for renaming in SDK

c683c1e

refactor: rename model config vars for clarity

90e98f6

Rename DEFAULT_GENERATION_MODEL → DEFAULT_LANGUAGE_MODEL_PROVIDER and DEFAULT_MODEL_NAME → DEFAULT_LANGUAGE_MODEL_NAME across all services

rename model_type to purpose

581e981

- rename model_type to purpose in _get_user_model and related functions to avoid confusion between model_type terminology (which refers to whether model is either language/embedding model)

refactor(frontend): change model_type from llm -> language

f26b2e6

refactor(sdk): change llm to language in model_type param

e79df19

change from 'language_model' to 'model' in schemas, routers and model…

561d187

… connection

fix: import in tests and remove unused aliases

46fed56

- use correct import (DEFAULT_LANGUAGE_MODEL_PROVIDER) in tests - remove unused aliases (DEFAULT_MODELS, DEFAULT_PROVIDER)

fix(test): use rhesis default embedding model

cbb1514

fix: import

62e6b2a

fix(test): 'get_model'-> 'get_language_model'

e87e8d9

feat(db): enable pgvector extension for vector storage

324d483

- Update PostgreSQL images from postgres:16-alpine to pgvector/pgvector:pg16 across all environments (local, k8s, CI/CD)

feat: add EmbeddableMixin for searchable text generation

0e37144

Add abstract mixin requiring entities to implement to_searchable_text() for embeddings and full-text search. Implement for Source and Test models.

fix: remove ABC inheritance from EmbeddableMixin

a8ffa1d

This fixes the metaclass conflict where Test was inherit both from Base (which uses sqlachlemy's DeclaritiveMeta metaclass) and ABC (which uses ABCMeta metaclass)

EmanueleDeRossi1 added 4 commits February 16, 2026 13:26

feat(models): add dimension auto-detection for embedding models

b0d0304

- Add dimension column to Model table - Auto-detect and store embedding dimensions during connection test

feat(embedding): add unique constraint to embedding model to avoid mu…

cfad66a

…ltiple identical active embeddings

refactor: simplify embedding connection test by removing unused endpo…

fddfa45

…int param The endpoint parameter was never used for embedding models, only for LLMs. Remove it from _test_embedding_connection

EmanueleDeRossi1 force-pushed the feature/embedding-service branch from 734c69a to 24fe6d0 Compare February 16, 2026 13:52

EmanueleDeRossi1 had a problem deploying to test February 16, 2026 13:52 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding Service#1286

Embedding Service#1286
EmanueleDeRossi1 wants to merge 26 commits intomainfrom
feature/embedding-service

EmanueleDeRossi1 commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

EmanueleDeRossi1 commented Feb 9, 2026

📝 Summary

📁 Files Changed ( 17 files)

📋 Commit Details

✅ Checklist

🧪 Testing

📸 Screenshots (if applicable)

🔗 Related Issues

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant