Heritage Results Archive

An unofficial, user-friendly interface for viewing semester results at the Heritage Institute of Technology.

The main page for selecting a batch.

Clean, searchable, and filterable results for a selected batch.

About The Project

Heritage Results Archive is a web application that scrapes, aggregates, and presents publicly available semester results for students of the Heritage Institute of Technology. The goal is to provide a fast, clean, and user-friendly interface to view and search through grade data without the clutter of the official portal.

Data is sourced using Selenium to handle the JavaScript-rendered official website and is stored locally in CSV files. The front-end is a lightweight Flask application that serves the data through a simple web UI and an internal API.

Motivation

I began this project out of personal curiosity and as a challenge to practice and showcase my skills in:

Web Scraping: Using Selenium to automate browser interactions and extract data from a dynamic website.
Backend Development: Building a web application and API with the lightweight Flask framework.
Data Handling: Processing and structuring scraped data into a clean, usable format (CSV).
Full-Stack Integration: Connecting a Python backend to a simple, effective front-end.

Key Features

🌐 Simple Web Interface: A clean, responsive UI for browsing results by batch.
🔍 Search & Filter: Instantly search for students or filter results.
📊 Ranked Results: View students ranked by their SGPA and YGPA.
🐍 Python Backend: A robust Flask server powers the application.
🦊 Selenium Scraping: Automated data ingestion from the public results portal.
📁 CSV Data Storage: Simple, transparent, and portable data backend.

Tech Stack

Aspect	Choice
Language	Python 3.12 (>=3.8 compatible)
Web	Flask
Scraping	Selenium (Firefox)
Storage	CSV
Config	Simple Python modules

Running Locally

Prerequisites

Python 3.12+ (expected to work on ≥3.8)
Firefox browser installed
geckodriver available on your system's PATH
- macOS: brew install geckodriver
- Linux: Download from Mozilla or use your distribution's package manager.
- Verify with: geckodriver --version

Installation & Setup

Clone the repository:

git clone https://github.com/shirsakm/heritage-db.git
cd heritage-db

Create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
python app.py
```

The application will be available at http://127.0.0.1:5000.

Repository Structure

.
├─ app.py             # Flask entrypoint
├─ config.py          # Configuration / constants
├─ requirements.txt   # Python dependencies
├─ src/               # Selenium scraping scripts / helpers
├─ data/              # Batch folders (e.g. 2022/, 2023/)
│  └─ <batch>/final.csv # Canonical dataset per batch
├─ models/            # Data model helpers / abstractions
├─ services/          # Service-layer logic (API/data access)
├─ utils/             # General utility functions
├─ templates/         # Jinja2 templates for HTML pages
└─ static/            # CSS / JS / other assets

Data & API Notes

Data Storage

The canonical data for each academic batch is stored in a final.csv file within its respective directory (e.g., data/2024/final.csv). For any data analysis or external use, reading these CSV files directly is the recommended approach.

Internal API

The project includes a minimal internal API (e.g., routes under /api/...) that the front-end uses to fetch data dynamically.

Please note: This API is unstable and not versioned. It is intended solely for internal consumption. Response formats and endpoints may change without notice. Relying on it for third-party applications is not recommended.

Roadmap (Aspirational)

Lightweight batch regeneration tooling
Basic aggregate statistics (averages, distributions)
Trend analysis per student
Optional JSON export (read-only)
Analytics dashboards (charts / grade curves)

Contribution Policy

This is a personal project, so I am not accepting pull requests at this time. However, if you find an issue or have a suggestion, please feel free to open an issue for discussion.

Privacy / Ethics

This tool only uses publicly accessible academic result data. No private credentials are used or stored.
If you have any concerns about the data, please open an issue for redactions or takedown requests.
The project's approach will be re-evaluated if the source portal's policies change.

License

This project is licensed under the MIT License. See the LICENSE file for details.

MIT License © 2025 Shirsak Majumder

Maintainer

Shirsak Majumder (@shirsakm)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heritage Results Archive

About The Project

Motivation

Key Features

Tech Stack

Running Locally

Prerequisites

Installation & Setup

Repository Structure

Data & API Notes

Data Storage

Internal API

Roadmap (Aspirational)

Contribution Policy

Privacy / Ethics

License

Maintainer

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
data		data
models		models
services		services
src		src
static		static
templates		templates
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
app.py		app.py
config.py		config.py
debug_csv.py		debug_csv.py
favicon.svg		favicon.svg
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Heritage Results Archive

About The Project

Motivation

Key Features

Tech Stack

Running Locally

Prerequisites

Installation & Setup

Repository Structure

Data & API Notes

Data Storage

Internal API

Roadmap (Aspirational)

Contribution Policy

Privacy / Ethics

License

Maintainer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages