| Họ và tên | Mã sinh viên |
|---|---|
| Nguyễn Tiến Khôi | 22022658 |
| Nguyễn Minh Hiếu | 22022609 |
| Tống Duy Tân | 22022538 |
| Quản Xuân Sơn | 22022519 |
| Nguyễn Quốc Tuấn | 22022553 |
Make sure you have the following installed on your system:
- Python 3.8 or higher
- Java 8
- Apache Kafka
- Hbase
- Hadoop
Additional requirements will be installed using the provided
requirements.txtfile.
git clone https://github.com/WelsneilT/Big-Data-Project.gitRun the following command to install all required Python packages:
pip install -r requirements.txtThe batch file initializes Kafka, Hadoop, and HBase services required for the project.
cd Big-Data-Project
./commands.batNavigate to the directory containing the streaming pipeline script and execute it:
cd Main/Lambda/Stream_layer/
python stream_pipeline.pyNavigate to the directory containing the Flask application and run the app:
cd Main/Lambda/real_time_web_app(Flask)/
python app.pyOnce the Flask server is running, open your browser and visit:
http://localhost:5000
- Ensure that Kafka and other dependencies are properly configured before running the
commands.batfile. - You must running with administrator permission.
- You must edit the paths in the
commands.batfile to match the installation directories of Kafka and HBase on your machine. - You can check if all services are running by
jpswhen runningcommands.bat. - All services must be up and running to avoid connection issues.

