🧠 Prema Review Intelligence

AI-powered E-commerce Review Analysis · FastAPI + Streamlit + SQLite

Prema Review Intelligence is a lightweight but production-ready prototype for automated analysis of large e-commerce review datasets.
It ingests raw CSV/JSON exports, stores them in SQLite, computes sentiment & thematic insights, exposes a FastAPI backend, and ships with a Streamlit dashboard for interactive exploration.

This project is part of the Prema Vision AI Automations portfolio.

🔍 What It Does

📥 Ingest raw review files (CSV/JSON) with dataset metadata
🧹 Validate files (size, type, row limits) for safe ingestion
💾 Store reviews in SQLite using SQLModel
🧠 Analyze sentiment, themes, and summary statistics
⚡ Cache heavy analyses to avoid recomputation
🔌 Expose a clean API with FastAPI
📊 Interactive dashboard built with Streamlit
🧪 Testable architecture with sample data and simple unit tests

🏗 Architecture Overview

app/
  analysis/        # Stats + theme extraction + LLM abstraction
  api/             # FastAPI routes, dependencies, middleware
  core/            # Settings, logging, environment config
  db/              # SQLModel models and DB engine
  ingestion/       # CSV/JSON ingestion pipeline
  schemas/         # Pydantic DTOs for API transport
  services/        # Orchestrators for datasets & analyses

dashboard/
  app.py           # Streamlit UI powered by the API

data/samples/      # Example datasets for demos
scripts/           # Helper scripts (e.g., ingest sample data)
tests/             # Lightweight unit tests

🚀 Getting Started

1. Install dependencies

poetry install

2. Seed the database with sample data

poetry run python scripts/ingest_sample.py

3. Run the API

poetry run uvicorn app.main:app --reload

4. (Optional) Run the Streamlit dashboard

poetry run streamlit run dashboard/app.py

Default dashboard API target:
http://localhost:8000
Override via: REVIEW_API_BASE_URL

🔌 API Overview

Health

GET /health

Datasets

POST /datasets — upload CSV/JSON and ingest
GET /datasets — list datasets
GET /datasets/{id}/summary?force={bool} — stats, sentiment, themes
GET /datasets/{id}/themes — structured theme extraction

Reviews

GET /reviews — filterable review list
- params: dataset, rating range, paging

Interactive docs:
http://localhost:8000/docs

🔐 Security Features

This project includes a practical security foundation:

✅ File type/size/content validation
✅ DoS protection via row & size limits
✅ CORS configuration via environment
✅ Rate limiting for LLM calls
✅ API key validation
✅ Input sanitization (Pydantic)
✅ Controlled DB access via SQLModel & engine pooling

For full guidance and deployment notes, see:
SECURITY.md

⚙️ Configuration

Defined via .env (see .env.example):

DATABASE_URL — default sqlite:///./data/app.db
MAX_THEMES — number of returned themes
LLM_PROVIDER / OPENAI_API_KEY — optional LLM integration
CORS_ALLOWED_ORIGINS
MAX_UPLOAD_SIZE — default 10MB
MAX_INGESTION_ROWS — DoS protection (default: 50,000)
LLM_RATE_LIMIT_RPM — default: 60

🧪 Development Tooling

poetry run black .
poetry run isort .
poetry run ruff check .
poetry run pytest

End-to-End Testing

Comprehensive e2e tests using Playwright are available in tests/e2e/:

# Install Playwright browsers (first time only)
poetry run playwright install chromium

# Run all e2e tests
poetry run pytest tests/e2e/ -v

# Run specific test categories
poetry run pytest tests/e2e/test_api_endpoints.py -v    # API tests
poetry run pytest tests/e2e/test_dashboard_ui.py -v    # UI tests
poetry run pytest tests/e2e/test_error_handling.py -v  # Error handling

See tests/e2e/README.md for detailed documentation.

🛣 Next Steps (Roadmap)

Swap MockLLMClient for OpenAI/Anthropic
Improve clustering & embeddings
Add product-level insights (pricing, defect themes)
Connect ingestion to live sources (Amazon, Shopify)
Add multi-dataset comparison & trend analysis

🤝 Contributing

Issues and PRs welcome.
Part of the Prema Vision internal AI automation suite.

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
dashboard		dashboard
data/samples		data/samples
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
APP_VALIDATION.md		APP_VALIDATION.md
README.md		README.md
SECURITY.md		SECURITY.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
validate_app.py		validate_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Prema Review Intelligence

AI-powered E-commerce Review Analysis · FastAPI + Streamlit + SQLite

🔍 What It Does

🏗 Architecture Overview

🚀 Getting Started

1. Install dependencies

2. Seed the database with sample data

3. Run the API

4. (Optional) Run the Streamlit dashboard

🔌 API Overview

Health

Datasets

Reviews

🔐 Security Features

⚙️ Configuration

🧪 Development Tooling

End-to-End Testing

🛣 Next Steps (Roadmap)

🤝 Contributing

About

Uh oh!

Releases

Packages

Languages

premavision/prema-review-intelligence

Folders and files

Latest commit

History

Repository files navigation

🧠 Prema Review Intelligence

AI-powered E-commerce Review Analysis · FastAPI + Streamlit + SQLite

🔍 What It Does

🏗 Architecture Overview

🚀 Getting Started

1. Install dependencies

2. Seed the database with sample data

3. Run the API

4. (Optional) Run the Streamlit dashboard

🔌 API Overview

Health

Datasets

Reviews

🔐 Security Features

⚙️ Configuration

🧪 Development Tooling

End-to-End Testing

🛣 Next Steps (Roadmap)

🤝 Contributing

About

Topics

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages