🎥 Semantic Video Retrieval System

A production-ready, multimodal video retrieval system that matches videos to natural language queries using powerful vision-language models (BLIP) and Vector Databases (ChromaDB).

This version includes a clean UI, speed optimizations, and robust error handling.

✨ Key Features

Natural Language Search: "Find the clip where a dog is running on grass."
Optimized Performance: Adjustable frame sampling (doesn't process every single frame, making it 5-10x faster).
Video Summary Generation: Automatically captions video content.
Clean UI: Tabbed interface for Search, Upload, and Library management.
Caching: Models load once, preventing slow reloads.

🧠 Tech Stack

Component	Technology
Language	Python
Vision Model	BLIP (Salesforce)
Vector DB	ChromaDB
Audio Processing	ffmpeg, pydub
Frame Extraction	OpenCV
UI / Deployment	Streamlit

🛠️ Installation

Clone the repository (or download the files).

Create a virtual environment (recommended):

python -m venv venv

# Windows
venv\Scripts\activate

# Mac/Linux
source venv/bin/activate

Install dependencies

pip install -r requirements.txt

#start the demo

streamlit run run.py

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.gitignore		.gitignore
README.md		README.md
database.py		database.py
packages.txt		packages.txt
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎥 Semantic Video Retrieval System

✨ Key Features

🧠 Tech Stack

🛠️ Installation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

jayanthkonanki/Semantic-Video-Retrieval-System

Folders and files

Latest commit

History

Repository files navigation

🎥 Semantic Video Retrieval System

✨ Key Features

🧠 Tech Stack

🛠️ Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages