Prediction of Student Dropout in Malaysian’s Private Higher Education Institute using Data Mining Application

Authors

  • Nurhana Roslan School of Quantitative Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah, Malaysia
  • Jastini Mohd Jamil School of Quantitative Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah, Malaysia
  • Izwan Nizal Mohd Shaharanee School of Quantitative Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah, Malaysia
  • Sultan Juma Sultan Alawi Ministry of Education, South Alsharqia, Sultane of Oman

DOI:

https://doi.org/10.37934/araset.45.2.168176

Keywords:

Educational data mining, Student dropout prediction, Data mining, Student attrition

Abstract

Student dropout issue is a major concern among the academics and management of the university. The higher rate of student dropout impacted the university reputation such as reducing student enrolment, affecting the revenue of the university, financial losses for the country, and increase the existence of a social problem among the students. In this study, 2 popular classifiers were utilized to predict the student dropout namely decision tree and logistic regression model respectively. Several sets of experimental setting were employed which include three set of data partitioning - along with different types of decision tree and regression model. As for the logistic regression model, different data imputation and transformation method was tested to ensure that the model built is valid. A total of 7706 student data extracted from one of the private universities in Malaysia database (between year 2018-2019) to assess the capability of the classifier. The classifier performance is evaluated using machine learning performance measure of accuracy and misclassification rate. The result indicates that, decision tree - chi-square (2 branches) achieved slightly better classification performance of 89.49% on 80/20 data partitioning. The chosen model also identified the most important variable for accurate prediction of student dropout. Application of this model has the potential to accurately predict at risk student and to reduce student dropout rates.

Downloads

Download data is not yet available.

Author Biographies

Nurhana Roslan, School of Quantitative Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah, Malaysia

hanaroslan1996@gmail.com

Jastini Mohd Jamil, School of Quantitative Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah, Malaysia

jastini@uum.edu.my

Izwan Nizal Mohd Shaharanee, School of Quantitative Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah, Malaysia

nizal@uum.edu.my

Sultan Juma Sultan Alawi, Ministry of Education, South Alsharqia, Sultane of Oman

hanaroslan1996@gmail.com

Published

2024-04-14

Issue

Section

Articles