Visualising Current Research Trends in Class Imbalance using Clustering Approach: A Bibliometrics Analysis

Authors

  • Nurul Syahida Abu Bakar STEM Foundation Center, Universiti Malaysia Terengganu (UMT), 21300 Kuala Nerus, Terengganu, Malaysia
  • Wan Fairos Wan Yaacob Mathematical Sciences Studies, College of Computing, Informatics and Media, Universiti Teknologi MARA Cawangan Kelantan, Kampus Kota Bharu, 15050 Kota Bharu, Kelantan, Malaysia
  • Yap Bee Wah UNITAR Graduate School, UNITAR International University, 47301 Petaling Jaya, Selangor, Malaysia
  • Wan Marhaini Wan Omar Faculty of Business and Management, Universiti Teknologi MARA (UiTM), Cawangan Kelantan, Kampus Kota Bharu, 15050 Kota Bharu, Kelantan Malaysia
  • Utriweni Mukhaiyar Statistics Research Division, Faculty of Mathematics and Natural Sciences, Bandung Institute of Technology, Indonesia

DOI:

https://doi.org/10.37934/araset.38.2.95111

Keywords:

Imbalance problem, clustering approach, network analysis, science mapping, big data

Abstract

In recent years, extensive research has been carried out on big data class imbalance problems using the clustering approach. The bibliometric analysis employs statistical techniques to map and assess trends in a specific research domain based on keywords, author affiliations, and citations. Bibliographic analysis assists us in comprehending unstructured big data. This study aims to present a comprehensive literature review on class imbalance problems using the clustering approach and identify gaps in the research domain using bibliometric analytical techniques. The Scopus and Web of Science databases were used to extract 663 articles on class imbalance data using a clustering approach published between 2010 and 2021. We used the VOS (Visualisation of Similarities) viewer to visualise the bibliometric analytical outcomes. Co-citation and co-word analysis were used to visualise the publication trend and identify areas of current research interest. The study's key findings evidenced a growing interest in the research domain. Herrera, f., and Chawla N. V. are dominant authors in this field, and China is leading the publication in the clustering approach for the big data imbalance problem. The top three affiliations are from China: Tsinghua University, the Chinese Academy of Sciences, and Beihang University. Conducting an in-depth bibliometric analysis using other databases such as Science Direct, IEEE, and Emerald is recommended. This study may assist researchers in understanding the nature of the big data imbalance problem using a clustering approach and providing insights for future research derived from these worldwide databases.

Downloads

Download data is not yet available.

Author Biographies

Nurul Syahida Abu Bakar, STEM Foundation Center, Universiti Malaysia Terengganu (UMT), 21300 Kuala Nerus, Terengganu, Malaysia

nsab@umt.edu.my

Wan Fairos Wan Yaacob, Mathematical Sciences Studies, College of Computing, Informatics and Media, Universiti Teknologi MARA Cawangan Kelantan, Kampus Kota Bharu, 15050 Kota Bharu, Kelantan, Malaysia

wnfairos@uitm.edu.my

Yap Bee Wah, UNITAR Graduate School, UNITAR International University, 47301 Petaling Jaya, Selangor, Malaysia

bee.wah@unitar.my

Wan Marhaini Wan Omar, Faculty of Business and Management, Universiti Teknologi MARA (UiTM), Cawangan Kelantan, Kampus Kota Bharu, 15050 Kota Bharu, Kelantan Malaysia

whaini299@uitm.edu.my

Utriweni Mukhaiyar, Statistics Research Division, Faculty of Mathematics and Natural Sciences, Bandung Institute of Technology, Indonesia

utriweni@math.itb.ac.id

Published

2024-01-30

Issue

Section

Articles