-
Notifications
You must be signed in to change notification settings - Fork 11
Matching service #627
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
MaxiLein
wants to merge
63
commits into
main
Choose a base branch
from
Matching-Service
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Matching service #627
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…onment configuration
- Update dev script in package.json to watch for .env changes - Add dotenv dependency for environment variable management - Modify config to include ollamaBearerToken from environment variables - Ensure asynchronous model loading in ensureAllHuggingfaceModelsAreAvailable - Include ollamaBearerToken in Ollama instance headers
This comment has been minimized.
This comment has been minimized.
…r service - Introduced custom error classes for better error context and handling. - Updated middleware to handle database errors and validation errors gracefully. - Improved logging for worker management, model initialization, and semantic splitting tasks. - Added verbose logging options to provide detailed runtime information. - Refactored resource retrieval functions to throw specific errors for better debugging. - Enhanced the reasoning and semantic splitting tasks with detailed error logging. - Implemented error handling in worker management to capture and log worker failures. - Updated the server initialisation process to handle model availability checks with error handling.
This comment has been minimized.
This comment has been minimized.
…atching tasks - Updated default batch size for Ollama from 5 to 20. - Introduced new configuration options for embedding and matching workers. - Improved error handling and logging in worker processes. - Refactored worker manager to support static worker pools for embedding and matching tasks. - Added health check mechanism for worker responsiveness. - Implemented job processing logic to handle multiple tasks efficiently. - Enhanced logging for better traceability of worker actions and statuses.
- Replaced console logging with a centralized logger in model, ollama, worker, and embedder modules. - Introduced structured logging with log levels (DEBUG, INFO, WARN, ERROR) and log types (server, request, worker, etc.). - Enhanced worker context management to propagate request IDs and log worker activities. - Removed verbose flag usage and replaced it with appropriate logging levels. - Improved error handling and logging in worker pools and job processing. - Cleaned up deprecated log structures and ensured consistent logging practices throughout the codebase.
…ries and updating log file paths
…hance worker and error handler logging
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
|
✅ Successfully created Preview Deployment. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Matching Micro-Service
Overview
This PR introduces the first end‑to‑end version of our "Competence Matcher" microservice. It provides a REST API that lets clients:
Behind the scenes, we:
vec0extension. Each competence embedding is stored alongside its metadata; at query time, we issue a k‑NN search (cosine / L2) to find the closest matches.WorkerManagerwith configurable concurrency. Each job spins up a worker, updates its status in the DB (pending → preprocessing → pending → running → completed/failed), and closes when done.matching-workflow.pdf
TODOs:
X-Real-IPheader