Benchmarking update by FranciscoLozCoding · Pull Request #1 · waggle-sensor/sage-nrp-image-search

FranciscoLozCoding · 2025-12-10T23:04:53Z

Benchmarking Framework and Infrastructure Abstraction

Overview

This PR introduces an abstract benchmarking framework for vector databases and models, with reusable components and infrastructure abstractions for easy deployment on NRP. The framework code is provided by the imsearch_eval Python package, while this repository contains benchmark implementations and deployment infrastructure.

Framework Architecture

The benchmarking framework is provided by the imsearch_eval package.

Infrastructure Abstraction

Kubernetes Deployment

Kustomize-based structure with base configurations and benchmark-specific overlays
Vector DB and inference server agnostic: Base configurations work with any stack
Each benchmark overlay adds environment variables for its specific services
Template system for creating new benchmark deployments

Dockerfile System

Template-based Dockerfiles for consistent container builds
Separate containers for data loading and evaluation

Makefile System

Base Makefile with common commands
Benchmark-specific Makefiles that extend the base
NRP-compatible commands and configurations

Template Directory

Complete starter kit for new benchmarks
Includes Makefile, Dockerfiles, Kubernetes configs, and Python templates
Documentation and quick start guides

Repository Structure

benchmarking/
├── benchmarks/              # Benchmark implementations
│   ├── template/           # Template for creating new benchmarks
│   └── INQUIRE/           # INQUIRE benchmark implementation
│
└── kubernetes/             # Kubernetes deployment configurations
    ├── base/              # Base configurations (vector DB/inference server agnostic)
    └── INQUIRE/           # INQUIRE-specific overlay

Benefits

Faster benchmark creation using templates
Consistent evaluation across benchmarks via shared package
Easy integration of new vector databases and models
Maintainable shared code in separate package repository
NRP-ready deployment infrastructure
Framework stability independent of app/ changes
Flexible infrastructure supporting any vector DB/inference server combination
Easy distribution: Benchmarks can be used in any environment by installing the package

Future Work

This framework enables:

Benchmarking Milvus, Pinecone, Qdrant, and other vector databases
Adding new datasets with minimal code
Comparing different models and vector databases
Easy contribution of new adapters to the imsearch_eval package

…nchmarking framework

… legacy framework and adapter files. Updated documentation and templates to reflect new structure and dependencies.

…inference server agnosticism.

…unnecessary None checks on processed batches.

… nrp-image-search

…ture

…eanup.

…t file

…structions, emphasizing the use of template files and naming conventions for secrets.

…treamline processing and insertion logic

…nclude logger name and level for improved clarity

… benchmark scripts to use ISO 8601 format

…d Completed phases

…the imsearch_benchmarks repository.

…or consistency across benchmarks.

FranciscoLozCoding and others added 30 commits December 10, 2025 10:41

Added references section to weavloader's README

b9d32db

Add abstract benchmarking framework

3495cc0

Add Weaviate and Triton adapters for benchmarking framework

52db292

Add Dockerfile and Makefile templates along with documentation for be…

854d991

…nchmarking framework

Added a template to help with adding benchmarks

5058890

ported existing INQUIRE benchmark code to use new framework

edd66c0

create NRP k8s deployment files

0519d63

removed files I dont need anymore

2aecc79

added benchmarking README

fe541f9

Add GitHub Actions workflow for building and pushing benchmark images

b4b26ed

renamed inquire_weav back to inquire

ed6ba5b

Refactor benchmarking framework to use imsearch_eval package; removed…

35beb04

… legacy framework and adapter files. Updated documentation and templates to reflect new structure and dependencies.

Update Kubernetes base configurations to support vector database and …

fa46803

…inference server agnosticism.

Refactor load_data function to simplify batch processing by removing …

3bef1d8

…unnecessary None checks on processed batches.

removed readme type file

d872341

moved structure docs to readme

2fca6b5

deleted structure.md

9c3ef45

Update image paths in workflows and Makefiles to use new registry for…

52d3729

… nrp-image-search

Test nrp gitlab ci/cd

ce0d645

Update triton github workflow

61c56e3

change weavloader paths in GitHub workflow to use new directory struc…

2b74bb0

…ture

switched to a deploy token

cf64df5

Updated GitHub workflow to use deploy token username for Docker login

4036683

update gradio workflow to work with new repo structure

f0c7637

added the new token in gradio workflow

52bd96d

updated triton workflow to work with new repo structure

5cb7500

updated weavmanage workflow to work with new repo structure

f8894f6

Enhance Triton workflow and Dockerfile for improved efficiency and cl…

7c7d624

…eanup.

revert back

988fa0f

Add disk space cleanup step to Triton workflow

954953f

FranciscoLozCoding added 8 commits January 13, 2026 15:39

Add S3 secret template for benchmarking Kubernetes deployment

666fda7

Update .gitignore to ignore files that start with ._

f5afbc5

Add Hugging Face secret template for Kubernetes deployment

6b4dc70

Update kustomization.yaml to reference the S3 secret

f197b21

Update kustomization.yaml to reference the renamed Hugging Face secre…

16ac52f

…t file

Add sage-user-secret template for Kubernetes deployment

021246f

Update kustomization.yaml to reference the renamed sage-user-secret file

3f2bb58

Update README files to clarify S3 secret management and deployment in…

ab24194

…structions, emphasizing the use of template files and naming conventions for secrets.

iperezx approved these changes Jan 14, 2026

View reviewed changes

FranciscoLozCoding added 18 commits January 20, 2026 12:06

Refactor data loading and evaluation in INQUIRE benchmark script to s…

3620b75

…treamline processing and insertion logic

Enhance logging format in INQUIRE and template benchmark scripts to i…

74623ca

…nclude logger name and level for improved clarity

Update timestamp format in S3 key generation for INQUIRE and template…

d954f38

… benchmark scripts to use ISO 8601 format

add a temporary debug step

171c137

update debug msg

89e8c90

Refactor debug statements in run_benchmark.py to use f-strings

62204a6

Improve pod log retrieval in Makefile by checking for both Running an…

2112150

…d Completed phases

removed temp debug msg

095c4a1

use datasets verion from imsearch_eval[huggingface]

3d5d7f7

Add Hugging Face token support in INQUIRE and template configurations

712a128

renamed secret

fc7b2c2

update

316e447

Update README.md to include information about existing benchmarks in …

2de8b8c

…the imsearch_benchmarks repository.

Pin imsearch_eval dependencies to version 0.1.0 in requirements.txt f…

224fdf3

…or consistency across benchmarks.

removed todos that have been completed

48da7c9

Remove imagePullPolicy TODO comment from benchmark-job.yaml

f7954d6

Update benchmark job image tag to 'latest' in kustomization.yaml

b9e7492

update README files

13b9b7a

FranciscoLozCoding merged commit 069eb6e into main Feb 3, 2026
3 checks passed

FranciscoLozCoding deleted the benchmarking branch February 3, 2026 17:34

FranciscoLozCoding added the benchmarking Improvements or additions to benchmarking label Feb 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking update#1

Benchmarking update#1
FranciscoLozCoding merged 123 commits intomainfrom
benchmarking

FranciscoLozCoding commented Dec 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

FranciscoLozCoding commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarking Framework and Infrastructure Abstraction

Overview

Framework Architecture

Infrastructure Abstraction

Repository Structure

Benefits

Future Work

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FranciscoLozCoding commented Dec 10, 2025 •

edited

Loading