ColBERT CRUD Application

About the Project

[Detailed description of the project yet to come]

This project implements a CRUD (Create, Read, Update, Delete) application using ColBERT (Contextualized Late Interaction over BERT) for efficient document retrieval and search. The application includes a data portal interface for easy interaction with the ColBERT system.

Key Features

Document indexing and retrieval using ColBERT
Web interface for searching documents
CRUD operations for document management
Training and evaluation of ColBERT models

Prerequisites

System Requirements

Unix/Linux environment (recommended)
- Windows users: Consider using WSL or Conda environment
Python 3.8+
CUDA-capable GPU (recommended for optimal performance)

Environment Setup

Clone the repository:

git clone git@github.com:ibohaji/Colbert-Crud-App-ess.git
cd Colbert-Crud-App-ess

Create and activate a virtual environment:

# Linux/Unix
python -m venv myenv
source myenv/bin/activate

# Windows (if not using WSL)
# Consider using Conda: https://docs.conda.io/projects/conda/en/latest/user-guide/install/

Install dependencies:

pip install -r requirements.txt

Running the Application

Start the Data Portal

python -m dataportal.app_colbert

The portal will be available at http://localhost:5000

Adding Documents to the Portal

Prepare your documents in JSON format:

{
    "doc_id": {
        "title": "Document Title",
        "text": "Document content..."
    }
}

Use the API endpoint:

curl -X POST http://localhost:5000/index \
     -H "Content-Type: application/json" \
     -d @your_documents.json

Acknowledgments & Credits

This project is built using ColBERTv2, an efficient and effective neural search engine:

Original Paper: "ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction"
Authors: Omar Khattab, Christopher Potts, and Matei Zaharia
Official Repository: stanford-futuredata/ColBERT
ColBERT-AI Library: colbert-ai

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
colbert_v2		colbert_v2
data		data
dataportal		dataportal
inference		inference
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
88		88
README.md		README.md
requirements.txt		requirements.txt
run_inference.py		run_inference.py
supsamples_run.py		supsamples_run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ColBERT CRUD Application

About the Project

Key Features

Prerequisites

System Requirements

Environment Setup

Running the Application

Start the Data Portal

Adding Documents to the Portal

Acknowledgments & Credits

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ColBERT CRUD Application

About the Project

Key Features

Prerequisites

System Requirements

Environment Setup

Running the Application

Start the Data Portal

Adding Documents to the Portal

Acknowledgments & Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages