Machine Learning in Reverse Migration Classification

Authors

  • Nur Huzeima Mohd Hussain Department of Built Environment and Technology, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia
  • Azreen Anuar Centre of Graduate Studies, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia
  • Suraya Masrom Computing Science Studies, College of Computing, Informatics and Media, Universiti Teknologi MARA, Perak Branch, Tapah Campus, Malaysia
  • Thuraiya Mohd Department of Built Environment and Technology, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia
  • Nur Azfahani Ahmad Department of Built Environment and Technology, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia
  • Hugh Byrd Lincoln School of Architecture, University of Lincoln, Lincoln, United Kingdom

DOI:

https://doi.org/10.37934/araset.38.2.4555

Keywords:

Machine learning, Auto model, Reverse migration, classification

Abstract

Reverse migration has become a more pressing issue in recent times, owing to a range of factors like economic downturns, political instability, natural disasters, and the COVID-19 pandemic. The pandemic, in particular, has highlighted the vulnerability of migrant workers in urban areas, leading many to return to their rural homes. As a result, reverse migration necessitates focused attention and planning by governments, policymakers, and communities to ensure favourable outcomes for all parties involved. This paper aims to provide a fundamental research framework from research that utilized a machine learning approach to classify reverse migration based on evidence from Selangor, Malaysia. The research methodology involves selecting features for reverse migration classification models and identifying optimal hyperparameters and experimental settings through auto model preliminary analysis. Furthermore, based on the findings of the auto model, the methodology was enhanced with a manual setting of machine learning. Three machine learning algorithms, namely Decision Tree, Random Forest, and Gradient Boosted Trees were used. The results of the auto model and the manual process that used different split ratios were compared. All the machine learning algorithms performed with a high accuracy of over 90% and were efficient in completing prediction tasks in under a minute across various settings. The best machine learning model with an accuracy of 97.6% is Gradient Boosted Trees with a split ratio of 60:40. The paper presents findings that could prove useful for governments, legal planners, investors, and the community in strategizing and surviving through an artificial intelligence prediction approach.

Downloads

Download data is not yet available.

Author Biographies

Nur Huzeima Mohd Hussain, Department of Built Environment and Technology, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia

nurhu154@uitm.edu.my

Azreen Anuar, Centre of Graduate Studies, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia

2020864206@student.uitm.edu.my

Suraya Masrom, Computing Science Studies, College of Computing, Informatics and Media, Universiti Teknologi MARA, Perak Branch, Tapah Campus, Malaysia

suray078@uitm.edu.my

Thuraiya Mohd, Department of Built Environment and Technology, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia

thura2321@uitm.edu.my

Nur Azfahani Ahmad, Department of Built Environment and Technology, Universiti Teknologi MARA, Perak Branch, Seri Iskandar Campus, Malaysia

nuraz020@uitm.edu.my

Hugh Byrd, Lincoln School of Architecture, University of Lincoln, Lincoln, United Kingdom

hbyrd@lincoln.ac.uk

Published

2024-01-31

Issue

Section

Articles