|
7 | 7 | \usepackage{graphicx} |
8 | 8 | \usepackage{textcomp} |
9 | 9 | \usepackage{xcolor} |
| 10 | +\usepackage{float} |
| 11 | +%\restylefloat{table} |
10 | 12 | \def\BibTeX{{\rm B\kern-.05em{\sc i\kern-.025em b}\kern-.08em |
11 | 13 | T\kern-.1667em\lower.7ex\hbox{E}\kern-.125emX}} |
12 | 14 | \begin{document} |
|
28 | 30 |
|
29 | 31 | We propose a solution to mitigate this dropout as well as helping academia to find \textit{excellence students} with compatible profile for a certain path (diploma and domain). Taking the problem at its root could lead to a \textit{two birds with one stone} resolution to the problem. |
30 | 32 |
|
31 | | -This research endeavours to address a critical issue within the education system, offering a more holistic and personalized approach to student placement. By reimagining the criteria for admission, we aspire to create a more harmonious and productive educational landscape for both students and academic institutions. |
| 33 | +This paper focuses on critical issues within the education system and tries to differ a more holistic and personalized approach to student placement. By using data mining, analytic and machine learning, we hope to create a more harmonious and productive education landscape for both students and academic alike. |
32 | 34 | \end{abstract} |
33 | 35 | \vspace{8pt} |
34 | 36 | \begin{IEEEkeywords} |
|
37 | 39 | \vspace{16pt} |
38 | 40 |
|
39 | 41 | \section{Introduction} |
40 | | -In today's rapidly evolving educational landscape, optimizing the admission process in higher education institutions has never been more critical. With the aftermath of the COVID-19 pandemic and the surge in students seeking higher education, universities face the daunting task of managing a vast number of admissions while ensuring they admit candidates best suited for their programs. It is estimated that for the year 2021-2022, 2.97 millions students have registered for higher education.\cite{sous-direction_des_systemes_dinformation_et_des_etudes_statistiques_sies_les_2022} |
| 42 | +Today education landscape in higher education is rapidly evolving due to higher levels of education and possibly COVID-19, the surge in higher education enrolment presents new challenges : how to manage increasing number of admissions and how to ensure academic success. |
| 43 | +An estimated 2.97 million students registering for higher education in France for the year 2021-2022, marking a 2.5\% increase from the previous year \cite{sous-direction_des_systemes_dinformation_et_des_etudes_statistiques_sies_les_2022}. Globally, UNESCO reported a rise from 28.5 million students in 1970 to an estimated 235 million in 2023 \cite{unesco_higher_2023}. |
41 | 44 |
|
42 | | -The Parcoursup system, introduced as a centralized platform for higher education admissions in France, aimed to streamline this process. While it brought some level of uniformity and transparency, criticisms have arisen regarding its limitations in truly identifying candidates' aptitude and ensuring a match between a student's potential and the chosen program. |
| 45 | +However, this increase in enrolment has not necessarily translated into higher success rates. As illustrated in Table \ref{tab:result_bachelor}, the cumulative success rate over 4 years for students enrolled in Bachelor's programs in France remains a concern, with only 39.8\% of the 2010 cohort succeeding by 2014. |
43 | 46 |
|
44 | | -\begin{table} |
| 47 | +\begin{table}[H] |
45 | 48 | \centering |
46 | 49 | \caption{Results in Bachelor's degree for the 2013 and 2014 sessions for students enrolled for the first time in the first year of Bachelor's in 2010-2011.\cite{kabla-langlois_fporsoc16b_ec2_enseignementpdf_2016}} |
47 | 50 | \begin{tabular}{|c|c|} |
48 | 51 | \hline |
49 | | - \textbf{Headcount 2010} & \textbf{Réussite cumulée en 4 ans (en \%)} \\ |
50 | | - \hline |
51 | | - 169 652 & 39.8 \\ |
52 | | - \hline |
| 52 | + \textbf{Headcount 2010} & \textbf{Cumulative success over 4 years (in \%)}\\ |
| 53 | + \hline |
| 54 | + 169 652 & 39.8 \\ |
| 55 | + \hline |
53 | 56 | \end{tabular} |
54 | | - |
55 | | - \label{tab:my_label} |
| 57 | + \label{tab:result_bachelor} |
56 | 58 | \end{table} |
57 | 59 |
|
58 | | -Enter the realm of machine learning and data analytics, technologies that promise a more nuanced and accurate approach to admissions. By analyzing a myriad of factors beyond traditional metrics, this approach seeks to identify candidates whose profiles align perfectly with specific programs, thereby increasing the likelihood of academic success. |
| 60 | +In France, the Parcour sup system was introduced to manage the influx of candidates and match them with suitable programs. Despite its intentions for uniformity and transparency, it has faced criticism for not adequately addressing the mismatch between student potential and program suitability, which is a contributing factor to the low success rates \cite{couto_parcoursup_2021}. |
| 61 | + |
| 62 | +Enter the realm of machine learning and data analytics, technologies that promise a more nuanced and accurate approach to admissions. By analysing a myriad of factors beyond traditional metrics, this approach seeks to identify candidates whose profiles align perfectly with specific programs, thereby increasing the likelihood of academic success and addressing the mismatch issue. |
| 63 | + |
| 64 | +This research endeavours to explore and validate the potential of machine learning and data analytics in revolutionizing the admission process. The motivation is twofold: to enhance the success rate of students by ensuring they are placed in programs where they are most likely to excel, and to reduce dropout rates by minimizing mismatches between students and programs. |
| 65 | + |
| 66 | +This research will explore and proposes of different approach, starting with a detailed methodology, followed by a case study made witin the University of Pau et des Pays de l'Adour. |
| 67 | +We will then conclude, taking into account our findings and the result of our experiment |
| 68 | + |
| 69 | +\section{State of the art} |
| 70 | +\label{sec:soa} |
| 71 | +The body of literature concerning student dropout is comprehensive, encompassing a range of perspectives from diverse academic disciplines. The issue has been approached through psychological and economic lenses, utilizing both qualitative and quantitative research methods. This section is structured to reflect the various domains and methodologies present in prior studies. |
| 72 | + |
| 73 | +We've cut our literature analysis in two major subsections. The first one, \ref{subsec:soa_analyticalapproeach}, will review the different ways of analytical analysis on student dropout. These different approaches went from simple adaptation models to economic models and psycho-pedagogic models to predict and describe student dropout. Then, in \ref{subsec:soa_predictiveapproach} we are going to study different mathematical algorithms and models and use machine learning models to predict student dropout. |
59 | 74 |
|
60 | | -This research endeavors to explore and validate the potential of machine learning and data analytics in revolutionizing the admission process. The motivation is twofold: to enhance the success rate of students by ensuring they are placed in programs where they are most likely to excel and to reduce dropout rates by minimizing mismatches between students and programs. |
| 75 | +\subsection{Analytical approach} |
| 76 | +\label{subsec:soa_analyticalapproeach} |
| 77 | +This first part will delve inside the different perspectives of research outside the predictive fields using machine learning. We found these different point of view interesting to study for our future predictive model. These research have different conclusion on why students dropout from their studies, from an economical standpoint to a psychological one. |
| 78 | +(Opazo et al. 2021)\cite{opazo_analysis_2021} have found an interesting link with the study from (Spady 1970) which found a correlation between the student dropout model and the social nature of suicide (\textit{social integration}), (Durkheim, 1951). This theory implies that the likelihood of suicide increase when there is an absence of : a)\textit{low normative congruence} and b)\textit{low friendship support}. Other literature from their state-of-the-art list these following as recurrent factors from multiple research\cite{opazo_analysis_2021}\cite{spady_dropouts_1970,tinto_dropout_1975,caspersen_teachers_2015,lidia_problema_2006,bejarano_caso_2017,sinchi_acceso_2018,cavero_voluntad_2011,velasco_alisis_nodate}: |
| 79 | +\begin{itemize} |
| 80 | +\item family |
| 81 | +\item previous educational background |
| 82 | +\item academic potential |
| 83 | +\item normative congruence |
| 84 | +\item friendship support |
| 85 | +\item intellectual development |
| 86 | +\item educational performance |
| 87 | +\item social integration |
| 88 | +\item satisfaction |
| 89 | +\item institutional commitment |
| 90 | +\item student adaptation |
| 91 | +\end{itemize} |
| 92 | +We can decisively take into account these factors for our study has they have been proven to be recurrent factors throughout the literature on predicting students dropout. |
61 | 93 |
|
62 | | -This paper will delve into the intricacies of the proposed approach, starting with a detailed methodology, followed by case studies from the University of Pau and Pays de l’Adour. The results section will present findings, and the discussion will explore implications, potential challenges, and future directions. |
| 94 | +\subsection{Predictive Approaches} |
| 95 | +\label{subsec:soa_predictiveapproach} |
| 96 | +Several researches focused on predictive approaches such as Naive Bayesian Algorithm, Association rules mining, ANN based algorithm, Logistic Regression, CART, C4.5, J48, Simple Logistic, JRip, Random Forest, Logistic regression analysis, ICRM2.\cite{mduma_survey_2019}. |
| 97 | +In this paper, we are going to analyse the following models \cite{mduma_survey_2019}\cite{quinlan_induction_1986,yadav_mining_2012,heredia_student_2015,ramirez_prediction_2018,cox_regression_1958,perez_modelo_2018,pandey_data_2011,hegde_higher_2018,cover_nearest_1967,mardolkar_forecasting_2020,zhang_neural_2000,rudin_stop_2019,siri_predicting_2015,m_alban_she_is_with_the_faculty_of_engineering_and_applied_sciences_of_the_technical_university_cotopaxi_neural_2019,cortes_support-vector_1995,boser_training_1992,cardona_predicting_2019,naicker_linear_2020,lee_machine_2019,behr_early_2020,friedman_stochastic_2002,eckert_analysis_2015,tenpipat_student_2020,liang_machine_2016,liang_big_2016,fischer_angulo_modelo_2012,miranda_analysis_2017,viloria_integration_2019,kemper_predicting_2020}: |
| 98 | +\begin{enumerate} |
| 99 | +\item Decision Trees |
| 100 | +\item Logistic Regression |
| 101 | +\item Naive Bayes |
| 102 | +\item K-Nearest Neighbors (KNN) |
| 103 | +\item Neural Networks |
| 104 | +\item Support Vector Machine |
| 105 | +\item Random Forest |
| 106 | +\end{enumerate} |
| 107 | +These models have been chosen as they have been the most recurring ones within the literature. We will analyse each models to list the pros and cons of each one and to determine which one(s) we should use to build our predictive system. |
63 | 108 |
|
64 | | -In the broader context of higher education, this research holds significant promise in addressing some of the pressing challenges faced by universities today. By reading this paper, stakeholders in the education sector will gain insights into a novel approach that not only promises better student outcomes but also paves the way for a more efficient and effective admission process. |
| 109 | +\subsubsection{Decision Trees} |
| 110 | +\subsubsection{Logistic Regression} |
| 111 | +\subsubsection{Naive Bayes} |
| 112 | +\subsubsection{K-Nearest Neighbors (KNN)} |
| 113 | +\subsubsection{Neural Networks} |
| 114 | +\subsubsection{Support Vector Machine} |
| 115 | +\subsubsection{Random Forest} |
| 116 | +\subsubsection{Conclusion} |
65 | 117 |
|
66 | 118 |
|
67 | 119 | \section*{Acknowledgment} |
|
0 commit comments