clone the repository
git clone https://github.com/
Create a virtual environment
conda create -n cpullama python=3.8 -y conda activate cpullama pip install -r requirements.txt python app.py Download the quantize model from the link provided in model folder & keep the model in the model directory:
llama-2-7b-chat.ggmlv3.q4_0.bin
https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGML/tree/main