Use of Natural Language Processing for the Detection of Hate Speech on Social Media

Authors

  • Mehedi Hasan Shohan Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
  • Kazi Rifat Ahmed Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
  • Nur Farhan Kahar Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
  • Nusrat Jahan Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
  • Md. Maruf Hassan Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
  • R. Badlishah Ahmad Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
  • Naimah Yaakob Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
  • Bi Lynn Ong Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
  • Nadira Islam Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

DOI:

https://doi.org/10.37934/araset.51.2.8696

Keywords:

Hateful speech, natural language processing, twitter, offensive language, deep natural network

Abstract

Our society’s communication patterns have fundamentally changed as a consequence of the emergence of social media platforms. One effect of these changes is a rise in unpleasant behaviours like making rude and derogatory comments online. Speaking harshly or disrespectfully to someone in person may be difficult. However, online abuse and posting of improper material are considered to be acceptable. Hate speech has the potential to hurt a person or a group of people. Inappropriate material must be identified, in order to be filtered or banned from the web. CNN is a type of deep machine-learning model that has been suggested for such identification, because it performs better than conventional techniques in resolving text categorization problems. Our goal investigates how hate speech may be detected using NLP. In addition, a recent technique has been used in this field to a dataset. This classifier is assigned in each tweet to one of the three Twitter dataset categories of hatred, foul language, or neither. This model’s performance has been assessed with accuracy. The Naïve Bayes, the Decision Tree, KNN, Linear Regression, and the Random Forest are five algorithms that have been used. Of these, Linear Regression provided the greatest accuracy of 94%. It should be noted that when looking at each class separately, many hateful tweets have been mislabelled. It is advisable to look at the outcomes and faults in much detail, in order to comprehend the misclassification. Our analysis shows a better outcome in detecting hateful speech in social media.

Downloads

Download data is not yet available.

Author Biographies

Mehedi Hasan Shohan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

shohan35-2165@diu.edu.bd

Kazi Rifat Ahmed, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

rifat.swe@diu.edu.bd

Nur Farhan Kahar, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

nurfarhan@unimap.edu.my

Nusrat Jahan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

nusrat.swe@diu.edu.bd

Md. Maruf Hassan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

ancssf@gmail.com

R. Badlishah Ahmad, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

badli@unimap.edu.my

Naimah Yaakob, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

naimahyaakob@unimap.edu.my

Bi Lynn Ong , Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

drlynn@unimap.edu.my

Nadira Islam, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

nadira.swe@diu.edu.bd

Downloads

Published

2024-09-19

How to Cite

Shohan, M. H., Ahmed, K. R., Kahar, N. F., Jahan, N., Hassan, M. M. ., Ahmad, R. B., Yaakob, N., Ong , B. L., & Islam, N. (2024). Use of Natural Language Processing for the Detection of Hate Speech on Social Media. Journal of Advanced Research in Applied Sciences and Engineering Technology, 51(2), 86–96. https://doi.org/10.37934/araset.51.2.8696

Issue

Section

Articles