Matching service #627

MaxiLein · 2025-07-29T14:30:58Z

Matching Micro-Service

Overview
This PR introduces the first end‑to‑end version of our "Competence Matcher" microservice. It provides a REST API that lets clients:

Create and store "competence lists" (each a snapshot of resources + associated competencies)
Embed all competency descriptions into a vector database (SQLite + sqlite‑vec)
Match arbitrary task descriptions against those stored competencies, returning the nearest neighbors

Behind the scenes, we:

Persist vectors in SQLite via the vec0 extension. Each competence embedding is stored alongside its metadata; at query time, we issue a k‑NN search (cosine / L2) to find the closest matches.
Semantically split long competency descriptions with a local LLM (via Ollama). This improves coverage by breaking large text blobs into coherent chunks before embedding.
Zero‑shot classify each candidate match to detect "semantic opposites" or contradictions (e.g. task asks for "good in X" vs. competence about "not good in X" We down‑weight or filter out matches whose zero‑shot label ("contradicting" vs. "neutral" vs. "aligning") indicates low relevance.
Offload heavy work (embedding and matching) into a pool of worker threads, coordinated by a simple WorkerManager with configurable concurrency. Each job spins up a worker, updates its status in the DB (pending → preprocessing → pending → running → completed/failed), and closes when done.

matching-workflow.pdf

TODOs:

…iles and models

…t and operations

… operations

…pport

…ation tasks

…onment configuration

- Update dev script in package.json to watch for .env changes - Add dotenv dependency for environment variable management - Modify config to include ollamaBearerToken from environment variables - Ensure asynchronous model loading in ensureAllHuggingfaceModelsAreAvailable - Include ollamaBearerToken in Ollama instance headers

…r service - Introduced custom error classes for better error context and handling. - Updated middleware to handle database errors and validation errors gracefully. - Improved logging for worker management, model initialization, and semantic splitting tasks. - Added verbose logging options to provide detailed runtime information. - Refactored resource retrieval functions to throw specific errors for better debugging. - Enhanced the reasoning and semantic splitting tasks with detailed error logging. - Implemented error handling in worker management to capture and log worker failures. - Updated the server initialisation process to handle model availability checks with error handling.

…atching tasks - Updated default batch size for Ollama from 5 to 20. - Introduced new configuration options for embedding and matching workers. - Improved error handling and logging in worker processes. - Refactored worker manager to support static worker pools for embedding and matching tasks. - Added health check mechanism for worker responsiveness. - Implemented job processing logic to handle multiple tasks efficiently. - Enhanced logging for better traceability of worker actions and statuses.

- Replaced console logging with a centralized logger in model, ollama, worker, and embedder modules. - Introduced structured logging with log levels (DEBUG, INFO, WARN, ERROR) and log types (server, request, worker, etc.). - Enhanced worker context management to propagate request IDs and log worker activities. - Removed verbose flag usage and replaced it with appropriate logging levels. - Improved error handling and logging in worker pools and job processing. - Cleaned up deprecated log structures and ensured consistent logging practices throughout the codebase.

…ries and updating log file paths

…hance worker and error handler logging

…script

github-actions · 2026-01-20T10:58:16Z

✅ Successfully created Preview Deployment.

https://pr-627---ms-server-staging-c4f6qdpj7q-ew.a.run.app

MaxiLein added 15 commits July 29, 2025 15:37

Add entries to .gitignore for competence-matcher build and database f…

c71409b

…iles and models

Add competence-matcher configuration files and update .gitignore

dc1afc6

Implement DBManager and VectorDataBase classes for database managemen…

73fbd66

…t and operations

Add utility files for database management, model handling, and worker…

f89ac75

… operations

Add worker scripts for embedding and matching tasks with reasoning su…

8ece5f6

…pport

Add middleware for logging and resource management in competence-matcher

4d58b1b

Add routes for resource and match handling in competence-matcher

e59d005

Add embedding, reasoning, semantic splitting, and zero-shot classific…

ae5edc1

…ation tasks

Add script to convert ONNX model weights to external data format

bb6d045

Update .gitignore to exclude competence-matcher model files and envir…

a686939

…onment configuration

Add uuid package as a dependency in package.json

5350020

Merge branch 'main' into Matching-Service

426afb8

Merge branch 'main' into Matching-Service

f9bc422

Add .prettierignore file and and ran prettier

bb66378

MaxiLein temporarily deployed to Staging August 19, 2025 11:15 — with GitHub Actions Inactive