This project performs semantic search using a custom-trained mapper and FAISS/ScaNN. https://colab.research.google.com/drive/1iYPJ2S3M5cEdvQSaa9UtFJ_846ZLO2Y-#scrollTo=B9ja7XCi8HUx put the dataset in dataset.py not preprocess.py
- FAISS/ScaNN vector search
- SentenceTransformer embeddings
- Custom trainable mapper network
- HotpotQA + MS MARCO datasets
pip install -r requirements.txt
python -m spacy download en_core_web_sm