Search engine project for the Information Retrieval course from CentraleSupelec.
Python 3.5+ is needed.
For the libraries, see requirements.txt:
pip install -r requirements.txtYou also need to put the data for both collections in a folder data (more precisely: data/CACM and data/pa1-data).
Please run Rapport.ipynb or check the notebook.
We built our model using SciPy sparse matrices.