Sage Image Search on NRP

This project uses a Hybrid Search approach where image captions are generated using gemma-3-4b-it, and a search is conducted using both:

Vector Search: Combining the vector embeddings of both the image and its generated caption.
Keyword Search: Leveraging the captions of the images for text-based search.

The Hybrid Search integrates both search types into one to improve accuracy and retrieval results. After retrieving the objects, they are passed into a reranker model to evaluate the relevance of the results based on the context of the query, ensuring that each object is compared more effectively.

Features:

gemma-3-4b-it for Caption Generation: Captions are generated for images using the gemma-3-4b-it model.
Vector Search: Utilizes embeddings of both the images and their captions to perform semantic search.
Keyword Search: Searches are also performed using keywords extracted from image captions.
Hybrid Search: A combination of vector and keyword searches to return the most relevant results.
Reranker: A model that refines the order of search results, ensuring that the most relevant documents or items are ranked higher. It goes beyond the initial retrieval step, considering additional factors such as semantic similarity, context, and other relevant features.

Authentication

For this service to work, you need to have the following credentials:

SAGE_USER: Your SAGE username
SAGE_TOKEN: Your SAGE token
HF_TOKEN: Your Hugging Face token

Your Sage credentials needs access to images on Sage. Any images that you don't have access to will be skipped.

Your Hugging Face token needs access to the models that are used in this service.

CI/CD Workflow: Build & Push Images

This repository includes a GitHub Action that builds and pushes Docker images for all Hybrid Image Search microservices to NRPs public image registry. The workflow runs automatically on pushes to the main branch and on pull requests, detecting changes and publishing updated service images to the configured container registry.

Docker compose

envs:

cp .env.example .env

Make sure to fill in the secrets (top three env vars)

Run:

docker compose up -d --build

Clean up:

docker compose down

All together:

docker compose down && docker compose up -d --build

Clean up (volumes):

docker compose down --volumes

Notes:

Triton migh not be able load either one of the models (CLIP and gemma3) or for some reason OSErrors loading the model weights so this is a workaround to download the models to your local directory and then move them to the container:

source .env #assumes that HF_TOKEN is set
cd triton
python3 -m venv env
source env/bin/activate
pip install -r requirements.txt
huggingface-cli download --local-dir DFN5B-CLIP-ViT-H-14-378  --revision "$CLIP_MODEL_VERSION" apple/DFN5B-CLIP-ViT-H-14-378

huggingface-cli download --local-dir gemma-3-4b-it --revision "$GEMMA_MODEL_VERSION" google/gemma-3-4b-it

docker cp DFN5B-CLIP-ViT-H-14-378 sage-nrp-image-search-triton-1:/models/
docker cp gemma-3-4b-it sage-nrp-image-search-triton-1:/models/

Kubernetes

Developed and test with these versions for k8s and kustomize:

Client Version: v1.29.1
Kustomize Version: v5.0.4

Create k8s secrets for Sage credentials by editing the sage-user-secret.yaml file.

Create k8s secrets for Hugging Face credentials by editing the huggingface-secret.yaml file.

Deploy all services:

kubectl apply -k nrp-dev or nrp-prod

Delete all services:

kubectl delete -k nrp-dev or nrp-prod

Debugging - output to yaml:

kubectl kustomize nrp-dev -o sage-image-search-dev.yaml or kubectl kustomize nrp-prod -o sage-image-search-prod.yaml

Workflow Overview

Caption Generation with gemma-3-4b-it:
- The gemma-3-4b-it model generates captions for images, allowing for both semantic and keyword-based search.
Vector Search:
- The embeddings of the images and their captions are stored in Weaviate. When a query is made, the relevant vectors are retrieved using similarity search (e.g., cosine similarity).
Keyword Search:
- The captions are indexed and can be searched with keywords. This enables traditional text-based search capabilities (e.g., bm25 algorithm).
Hybrid Search:
- A hybrid search combines the results from both the vector search and the keyword search. This improves result relevance by considering both semantic similarity and exact text matches.
Reranking:
- After retrieving the results, a reranker model evaluates them against the original query. This model takes into account context to ensure that the most relevant and accurate results are returned.

References

Weaviate Documentation:
Triton Documentation:

Name		Name	Last commit message	Last commit date
Latest commit History 1,401 Commits
.github/workflows		.github/workflows
app		app
benchmarking		benchmarking
kubernetes		kubernetes
triton		triton
weavloader		weavloader
weavmanage		weavmanage
.env.example		.env.example
.gitignore		.gitignore
Readme.md		Readme.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sage Image Search on NRP

Features:

Authentication

CI/CD Workflow: Build & Push Images

Docker compose

Kubernetes

Workflow Overview

References

TODOs

About

Uh oh!

Releases

Packages

Languages

waggle-sensor/sage-nrp-image-search

Folders and files

Latest commit

History

Repository files navigation

Sage Image Search on NRP

Features:

Authentication

CI/CD Workflow: Build & Push Images

Docker compose

Kubernetes

Workflow Overview

References

TODOs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages