At-Risk Student CLustering
Identifying At-Risk Students Using Clustering Techniques
- Jayadev Tripathi
- Arbin Shrestha
- Ujwal Raj Bhattarai
This project explores how clustering algorithms, specifically k-means, can be used to group students based on their performance metrics. The aim is to identify at-risk students by categorizing them into groups such as "high-performing," "average," and "at-risk." This classification helps educators to provide timely interventions and support where needed. The project uses a dataset with various student performance metrics and builds a model to perform clustering. An interactive dashboard is included to visualize these clusters and their correlations.
- Python: Programming language used for the implementation.
- Streamlit: Framework used to develop the interactive web interface for data visualization.
- scikit-learn: Library used for implementing the k-means clustering algorithm.
- pickle: Used for saving and loading the clustering model.
- Pandas: For data manipulation and handling.
- NumPy: For numerical operations.
- Plotly: For interactive data visualizations.
- Create Virtual Environment
python -m venv 'env'
- Activate Virtual Environment
venv\Scripts\activate
- Install Dependencies
pip install -r requirements.txt
After setting up the environment and installing the dependencies, you can run the project as described in the project documentation.
To run the project, you can use the following command:
streamlit run interface.py


