Semi-Supervised Learning: Assisted Cardiovascular Disease Forecasting using Self-Learning Approaches

Authors

  • Ekramul Haque Tusher Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia
  • Mohd Arfian Ismail Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia
  • Ferose Khan Department of Computer Science and Engineering, International Islamic University Chittagong, Kumira 4318, Bangladesh
  • Anis Farihan Mat Raffei Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia
  • Jalal Uddin Md Akbar Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia

DOI:

https://doi.org/10.37934/araset.56.1.136150

Keywords:

Cardiovascular disease, Naïve Bayes, Semi-supervised learning, Random Forest, Support vector machine

Abstract

Cardiovascular diseases (CVDs) are characteristics that affect both the heart and the blood vessels. This disease is the main factor contributing to the greatest number of deaths globally. In the present global context, it is very difficult to detect cardiovascular diseases by early-stage symptoms. If this isn't diagnosed early, it could lead to death. In order to improve the accuracy of the CVD prediction system, a wide variety of supervised and unsupervised learning approaches from the fields of machine learning were used. Only labelled data is used in supervised learning systems to create a classification model but acquiring sufficient amounts of labelled data takes time and typically requires the cooperation of field experts. However, unlabelled samples are readily available in a variety of real-world situations. More effectively than any other machine learning approach, semi-supervised learning (SSL) addresses this problem by integrating quantities of labelled and unlabelled data to improve the classification model. In this work, we propose semi-supervised learning approaches based on self-learning with Support Vector Machine (SVM), Naïve Bayes (NB) and Random Forest (RB). According to the comparison's findings, SVM has a high classification accuracy rate of 94.68%, a recall rate of 94.41%, a sensitivity rate of 94.49%, a F1 score rate of 92.99%, a precision rate of 91.59% a low, a balanced accuracy rate 94%, a G-mean rate of 94.45 and low Error-rate 5.32%. The model may be used to forecast cardiovascular disorders in the medical profession.

Downloads

Download data is not yet available.

Author Biographies

Ekramul Haque Tusher, Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia

ekramulhaquetusher@gmail.com

Mohd Arfian Ismail, Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia

arfian@ump.edu.my

Ferose Khan, Department of Computer Science and Engineering, International Islamic University Chittagong, Kumira 4318, Bangladesh

ferosekhan322@gmail.com

Anis Farihan Mat Raffei, Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia

anisfarihan@ump.edu.my

Jalal Uddin Md Akbar, Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, 26600 Pekan, Pahang, Malaysia

jalaluddinmdakbar00@gmail.com

Downloads

Published

2024-10-08

Issue

Section

Articles

Most read articles by the same author(s)