Sentiment Analysis on Acceptance of COVID-19 Vaccine for Children based on Support Vector Machine

Authors

  • Muhammad Adam Sani Mohd Sofian College of Computing, Informatics and Mathematics, Universiti Teknologi MARA Cawangan Terengganu, Kampus Kuala Terengganu, 21080 Kuala Terengganu, Malaysia
  • Norlina Mohd Sabri
  • Ummu Fatihah Mohd Bahrin College of Computing, Informatics and Mathematics, Universiti Teknologi MARA Cawangan Terengganu, Kampus Kuala Terengganu, 21080 Kuala Terengganu, Malaysia
  • Hrishvanthika N. Department of Information Technology, Bharathiar University, Coimbatore 641046 , Tamil Nadu, India
  • Norulhidayah Isa College of Computing, Informatics and Mathematics, Universiti Teknologi MARA Cawangan Terengganu, Kampus Kuala Terengganu, 21080 Kuala Terengganu, Malaysia

DOI:

https://doi.org/10.37934/araset.58.2.252270

Keywords:

COVID-19 children vaccine, acceptance, sentiment analysis, support vector machine, Twitter data

Abstract

Sentiment Analysis is a Natural Language Processing (NLP) branch that focuses on the analysis of the public opinions based on specific topics. The studies on sentiment analysis have been increasing since the COVID-19 pandemic in 2020. After the herd immunity has been reached around the world, the attention has shifted toward the children's COVID-19 vaccination program. It is beneficial to mine people’s opinions regarding this issue since kids are often perceived as vulnerable and need the parents’ consent. The machine learning based sentiment analysis has proven to be more efficient in the sentiment classification. This study aims to explore the capability of the Support Vector Machine (SVM) algorithm in the sentiment classification of the COVID-19 vaccination for children based on Twitter data. SVM has been one of the powerful algorithms, but never tested in this classification problem. The dataset for this project was scraped using Twitter API Tweepy based on the keywords such as “COVID vaccine children” and “5 to 11 vaccine”. The SVM model is based on the Linear Kernel and has been tested with the hold out method. The model has performed better in the balanced dataset with the implementation of oversampling by the SMOTE technique. A GUI prototype has also been developed using TKinter for the SVM classifier. The results have been divided into the data exploratory and the algorithm’s performance analyses. In this study, it is found that there are actually more people are supporting the COVID-19 vaccination for children. Meanwhile, based on the performance analysis, SVM has been able to classify the positive and negative tweets with an acceptable accuracy of 82%. The future work includes the scrapping of data from other social media platforms for larger demographics and also to compare SVM performance with other algorithms.

Downloads

Download data is not yet available.

Author Biographies

Muhammad Adam Sani Mohd Sofian, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA Cawangan Terengganu, Kampus Kuala Terengganu, 21080 Kuala Terengganu, Malaysia

adamsani2000@gmail.com

Ummu Fatihah Mohd Bahrin, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA Cawangan Terengganu, Kampus Kuala Terengganu, 21080 Kuala Terengganu, Malaysia

ummufatihah@uitm.edu.my

Hrishvanthika N., Department of Information Technology, Bharathiar University, Coimbatore 641046 , Tamil Nadu, India

hrishvanthika2002@gmail.com

Norulhidayah Isa, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA Cawangan Terengganu, Kampus Kuala Terengganu, 21080 Kuala Terengganu, Malaysia

norul955@uitm.edu.my

Downloads

Published

2024-10-07

Issue

Section

Articles