PDF Buddy-RAG-Haystack is a Retrieval-Augmented Generation (RAG) application that allows users to upload PDF documents, process their content, and interact with them using a chatbot interface. Built using Haystack, Gradio, and Hugging Face API, this project enables users to retrieve relevant document content and generate AI-powered responses to queries.
To run this project locally, ensure you have Python installed and then install the required dependencies:
pip install gradio haystack-ai pypdfRun the script to launch the Gradio web interface:
python app.pyThen, open the interface in your browser, upload a PDF, and start interacting with the chatbot.
- Document Processing Pipeline: Converts, cleans, and embeds PDF text for retrieval.
- Query Pipeline: Retrieves relevant document snippets and generates AI-powered answers.
- Gradio Chatbot: Provides an easy-to-use web-based interface for user interaction.
- Upload a PDF: The document is processed and stored.
- Ask a Question: The system retrieves relevant document content.
- Generate an Answer: The AI model formulates a response based on the retrieved content.
- Haystack for document processing and retrieval.
- Hugging Face API for response generation.
- Gradio for the user interface.
- SentenceTransformers for text embeddings.