Invoice Extraction App

A full-stack document extraction application that uses AI to extract structured data from invoice images.

🚀 Features

AI-Powered Extraction: Uses GPT-4o Vision with Gemini fallback for 95%+ accuracy
Supported Formats: PNG, JPG, WEBP, PDF invoice images
Real-time Progress: SSE streaming shows actual backend processing steps
Edit Before Save: Inline editing with auto-recalculation of totals
SalesOrder Display: View SalesOrderHeader with expandable SalesOrderDetail rows
Excel Database: Saves extracted orders to Extracted_Orders.xlsx
Multiple Invoice Support: Works with various invoice formats and templates

📋 Prerequisites

Node.js 18+ (for frontend)
Python 3.10+ (for backend)
OpenAI API Key (required)
Google Gemini API Key (optional, for fallback)

🛠️ Installation

1. Clone the Repository

cd test-project

2. Backend Setup

cd backend

# Create virtual environment
python -m venv venv

# Activate virtual environment
source venv/bin/activate  # macOS/Linux
# OR
.\venv\Scripts\activate   # Windows

# Install dependencies
pip install -r requirements.txt

# Create .env file
cp .env.example .env
# Edit .env and add your API keys:
# OPENAI_API_KEY=sk-...
# GEMINI_API_KEY=... (optional)

3. Frontend Setup

cd frontend

# Install dependencies
npm install

🏃 Running the Application

Start Backend (Terminal 1)

cd backend
source venv/bin/activate
python run.py

Backend runs at: http://localhost:5001

Start Frontend (Terminal 2)

cd frontend
npm run dev

Frontend runs at: http://localhost:3000

📁 Project Structure

test-project/
├── backend/
│   ├── app/
│   │   ├── __init__.py      # Flask app factory
│   │   ├── routes.py        # API endpoints
│   │   ├── database.py      # Excel operations
│   │   ├── extraction.py    # LLM integration
│   │   ├── errors.py        # Error handling
│   │   └── utils.py         # Helpers
│   ├── run.py               # Entry point
│   └── requirements.txt
├── frontend/
│   ├── src/app/
│   │   ├── page.tsx         # Home page
│   │   ├── upload/page.tsx  # Upload & extraction
│   │   ├── invoices/page.tsx # View orders
│   │   └── adr/page.tsx     # Architecture docs
│   └── package.json
└── data/
    ├── Case Study Data.xlsx  # Reference data (read-only)
    └── Extracted_Orders.xlsx # Extracted orders (auto-created)

🔌 API Endpoints

Endpoint	Method	Description
`/api/health`	GET	Health check
`/api/database/stats`	GET	Database statistics
`/api/database/orders`	GET	List extracted orders
`/api/invoices/upload-stream`	POST	Extract with SSE progress
`/api/invoices/save-edited`	POST	Save edited invoice
`/api/llm/status`	GET	LLM provider status

📊 Data Flow

Invoice Image → Upload API → GPT-4o Vision → JSON → Validation → Edit UI → Excel

🔧 Configuration

Environment Variables (.env)

# Required
OPENAI_API_KEY=sk-your-openai-key

# Optional (for fallback)
GEMINI_API_KEY=your-gemini-key

# Server
FLASK_DEBUG=1
FLASK_HOST=0.0.0.0
FLASK_PORT=5001
CORS_ORIGINS=http://localhost:3000

Frontend Environment Variables (.env.local)

# API Base URL (optional, defaults to localhost:5001)
NEXT_PUBLIC_API_URL=http://localhost:5001

🧪 Testing

Test Backend Health

curl http://localhost:5001/api/health

Test Extraction

Open http://localhost:3000/upload
Upload an invoice image
View extracted data in modal
Optionally edit and save to database

📄 Sample Invoices

Sample invoices are located in data/:

Sales Invoice.png - Original sample
invoice_template_b.png - Professional blue design
invoice_template_c.png - Black/white tax invoice

📚 Architecture Decision Records

View technical decisions at: http://localhost:3000/adr

Key decisions:

Next.js + Flask architecture
Excel as database (no external DB required)
GPT-4o Vision with Gemini fallback
SSE for real-time progress updates
shadcn/ui component library

Also includes Scaling Strategies section covering:

Higher volume handling (queue-based processing)
Additional document types (POs, receipts, contracts)
Production database migration (PostgreSQL)
Cloud deployment (Kubernetes)

📝 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
backend		backend
data		data
demo		demo
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Invoice Extraction App

🚀 Features

📋 Prerequisites

🛠️ Installation

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

🏃 Running the Application

Start Backend (Terminal 1)

Start Frontend (Terminal 2)

📁 Project Structure

🔌 API Endpoints

📊 Data Flow

🔧 Configuration

Environment Variables (.env)

Frontend Environment Variables (.env.local)

🧪 Testing

Test Backend Health

Test Extraction

📄 Sample Invoices

📚 Architecture Decision Records

📝 License

About

Uh oh!

Releases

Packages

Languages

Lalman888/document-extraction-app

Folders and files

Latest commit

History

Repository files navigation

Invoice Extraction App

🚀 Features

📋 Prerequisites

🛠️ Installation

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

🏃 Running the Application

Start Backend (Terminal 1)

Start Frontend (Terminal 2)

📁 Project Structure

🔌 API Endpoints

📊 Data Flow

🔧 Configuration

Environment Variables (.env)

Frontend Environment Variables (.env.local)

🧪 Testing

Test Backend Health

Test Extraction

📄 Sample Invoices

📚 Architecture Decision Records

📝 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages