Skip to content

AreebAhmad-02/Embedding-Models-Finetuning

About

The repository includes code for generating synthetic datasets using Zypher 7B from Hugging Face, with QA embeddings and anchor-positive similarity scores. It also handles preprocessing of scanned PDFs with OCR via Google Vision API. The code supports model fine-tuning with various loss functions and evaluation using Sentence Transformers evaluator

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors