infroSphere RAG: AI-Powered Document Processing System 🤖

infoSphere RAG is an AI-powered document processing system that allows users to upload and analyze various document types. It leverages a combination of Python libraries and Google's Generative AI to extract meaningful content, answer questions, and generate summaries from document data.

🌟 Key Features

PDF & Image Processing: Upload and process both PDF and image files, including scanned documents.
AI-Powered Query Processing: Use Google Generative AI in conjunction with LangChain to ask questions and get intelligent answers based on your documents.
Vector Search: Efficiently retrieve relevant information from documents using a FAISS vector store.
Flexible Text Extraction: The system uses PyMuPDF for native text extraction from PDFs and falls back to pytesseract (OCR) for scanned PDFs and images.
Interactive Web Interface: A user-friendly Gradio web interface makes it easy to upload files and interact with the AI assistant.
Environment Variable Support: Manage API keys and other configurations securely using a .env file.

🚀 How It Works

Documental operates by breaking down uploaded documents into smaller, meaningful chunks. These chunks are then converted into numerical representations called embeddings using a chosen model (either Google's text-embedding-004 or a local Sentence Transformer model). These embeddings are stored in a FAISS vector database.

When you ask a question, the system converts your query into an embedding and searches the vector database for the most relevant document chunks. These chunks are then provided as context to a Google Generative AI model, which generates a concise, accurate answer based only on the provided information.

💻 Setup Instructions

This guide will walk you through setting up the Documental project on your local machine.

1. Prerequisites

Python 3.10+ (or a later version)
VS Code (or another code editor of your choice)
A Google API Key for Generative AI access.

2. Environment Setup

Clone the Repository:

git clone [repository_url]
cd documental

Create a Virtual Environment:
```
python -m venv venv
```
Activate the Virtual Environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```
Install Dependencies:
```
pip install -r requirements.txt
```
Configure API Keys: Create a file named .env in the root directory and add your Google API key.
```
GOOGLE_API_KEY="YOUR_API_KEY_HERE"
```
Note: The EMBEDDING_BACKEND can be set to "gemini" or "minilm". Gemini embeddings require the GOOGLE_API_KEY to be configured. [cite_start]The default setting is "gemini"[cite: 2].

3. Running the Project

The project consists of a backend Flask API (app.py) and a frontend Gradio UI (gradio_app.py).

Start the Backend API: Open a terminal and run the following command to start the Flask backend.
```
python app.py
```
Start the Gradio UI: Open a second terminal and run the following command to start the Gradio web interface.
```
python gradio_app.py
```
Access the Application: Open your web browser and navigate to the local address provided by Gradio, typically http://127.0.0.1:7860.

🖼️ User Interface

Home Screen

Upon launching, the Gradio UI provides a straightforward interface to check the backend status and upload documents.

Upload Documents & Images

Use the "Upload Files" tab to drag and drop your PDF and image files. The system will process them and confirm the number of documents and chunks added to the vector store.

Chat with Your Documents

Once files are indexed, switch to the "Chat" tab. You can ask questions and receive answers with citations directly from your uploaded documents.

📚 Libraries and Technologies Used

Library	Purpose
Flask	Backend API framework
Gradio	Web interface for the application
LangChain	AI & Large Language Model (LLM) integration
FAISS	Vector database for storing document embeddings
PyMuPDF / pytesseract	Used for extracting text from PDFs and images
Google Generative AI	The core AI model for generating answers
Sentence Transformers	Used for generating embeddings, including a local CPU-friendly model

🛠️ Common Issues

Missing Modules: If you encounter a ModuleNotFoundError, ensure all dependencies are installed by running pip install -r requirements.txt.
Port Already in Use: If the server fails to start because a port is already in use, you can change the port number in the gradio_app.py or app.py files.
Dependency Conflicts: If there are conflicts, try updating the requirements.txt file or reinstalling specific packages.

If you have questions or feedback, please open an issue in the project repository.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
input_data/faiss_gemini		input_data/faiss_gemini
.gitignore		.gitignore
README.md		README.md
app.py		app.py
generate_doc.py		generate_doc.py
gradio_app.py		gradio_app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

infroSphere RAG: AI-Powered Document Processing System 🤖

🌟 Key Features

🚀 How It Works

💻 Setup Instructions

1. Prerequisites

2. Environment Setup

3. Running the Project

🖼️ User Interface

Home Screen

Upload Documents & Images

Chat with Your Documents

📚 Libraries and Technologies Used

🛠️ Common Issues

About

Uh oh!

Releases

Packages

Languages

DevCoder-247/infoSphere-RAG

Folders and files

Latest commit

History

Repository files navigation

infroSphere RAG: AI-Powered Document Processing System 🤖

🌟 Key Features

🚀 How It Works

💻 Setup Instructions

1. Prerequisites

2. Environment Setup

3. Running the Project

🖼️ User Interface

Home Screen

Upload Documents & Images

Chat with Your Documents

📚 Libraries and Technologies Used

🛠️ Common Issues

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages