Skip to content

Evaluating student demographics, engagement and performance metrics for predicting dropout likelihood using supervised learning - Neural Networks & Decision Trees.

Notifications You must be signed in to change notification settings

PaPJoIN/supervised-learning-class-dropout

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

10 Commits
ย 
ย 
ย 
ย 

Repository files navigation

Applying Supervised Learning to predict student dropout

Evaluating student demographics, engagement and performance metrics for predicting dropout likelihood using supervised learning techniques - Neural Networks & Decision Trees.

๐Ÿ”ธ Data

The data used in this project was divided into 3 stages, each representing the students' progression through their academic journey, starting with enrollment details, followed by incorporating engagement metrics and finally introducing subject gradings.

๐Ÿ”ธ Methodology

  • Performed data exploration and the necessary preprocessing steps, including encoding and scaling.
  • Identified the key correlations between variables.
  • Defined, compiled and trained the Neural Network models.
  • Performed hyperparamater optimisation using Keras Tuner to further refine accuracy.
  • Instantiated and trained the Extreme Gradient Boosting models.
  • Applied Grid & Ramdom-Search to further tune the XGBoost models.
  • Calculated and compared the models' perfomances across a range of metrics [Accuracy, AUC, Recall & Precision].

๐Ÿ”ธ Tools

  • Python, NumPy, Pandas
  • Matplotlib, Seaborn, Scikit-learn
  • Keras-Tensorflow & Keras Tuner
  • XGboost & Sklearn's GridSearch Cross Validation

๐Ÿ”ธ Results & Findings

  • The conducted data exploration and analysis provided valuable insights into the factors influencing student dropout. The predictive performance of both XGBoost and Neural Network models progressively improved across the three stages of data inclusion, demonstrating the benefit of incorporating more detailed and performance-based student information (Figures below).
  • The findings further revealed areas for improvement. For this, a number of solutions were proposed for imporving in the recruitment & monitoring of student to ensure maximised rates of course completion, including:
    • Refined Candidate Vetting and tracking Risk profiles per intake: Conduct further refinement and incorporate predictive indicators identified in application data into the admissions process to better evaluate dropout risk and provide guided support.

    • Enhanced Student Engagement Monitoring: Integrate systems for continuous monitoring of student absences and engagement metrics and develop automated alerts to flag students exhibiting decreasing attendance.

    • Strengthened Academic Support triggers: Design and implement targeted programmes for addressing academic learning challenges, such as academic leads being alerted when students reach dropout-risk thresholds (e.g., >30 unauthorised absences, 1+ failed modules).

image image

About

Evaluating student demographics, engagement and performance metrics for predicting dropout likelihood using supervised learning - Neural Networks & Decision Trees.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published