Use of Natural Language Processing for the Detection of Hate Speech on Social Media

Mehedi Hasan Shohan; Kazi Rifat Ahmed; Nur Farhan Kahar; Nusrat Jahan; Md. Maruf  Hassan; R. Badlishah Ahmad; Naimah Yaakob; Bi Lynn Ong; Nadira Islam

doi:10.37934/araset.51.2.8696

Authors

Mehedi Hasan Shohan Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
Kazi Rifat Ahmed Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
Nur Farhan Kahar Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
Nusrat Jahan Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
Md. Maruf Hassan Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
R. Badlishah Ahmad Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
Naimah Yaakob Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
Bi Lynn Ong Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia
Nadira Islam Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

DOI:

https://doi.org/10.37934/araset.51.2.8696

Keywords:

Hateful speech, natural language processing, twitter, offensive language, deep natural network

Abstract

Our society’s communication patterns have fundamentally changed as a consequence of the emergence of social media platforms. One effect of these changes is a rise in unpleasant behaviours like making rude and derogatory comments online. Speaking harshly or disrespectfully to someone in person may be difficult. However, online abuse and posting of improper material are considered to be acceptable. Hate speech has the potential to hurt a person or a group of people. Inappropriate material must be identified, in order to be filtered or banned from the web. CNN is a type of deep machine-learning model that has been suggested for such identification, because it performs better than conventional techniques in resolving text categorization problems. Our goal investigates how hate speech may be detected using NLP. In addition, a recent technique has been used in this field to a dataset. This classifier is assigned in each tweet to one of the three Twitter dataset categories of hatred, foul language, or neither. This model’s performance has been assessed with accuracy. The Naïve Bayes, the Decision Tree, KNN, Linear Regression, and the Random Forest are five algorithms that have been used. Of these, Linear Regression provided the greatest accuracy of 94%. It should be noted that when looking at each class separately, many hateful tweets have been mislabelled. It is advisable to look at the outcomes and faults in much detail, in order to comprehend the misclassification. Our analysis shows a better outcome in detecting hateful speech in social media.

Downloads

Download data is not yet available.

Author Biographies

Mehedi Hasan Shohan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

shohan35-2165@diu.edu.bd

Kazi Rifat Ahmed, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

rifat.swe@diu.edu.bd

Nur Farhan Kahar, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

nurfarhan@unimap.edu.my

Nusrat Jahan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

nusrat.swe@diu.edu.bd

Md. Maruf Hassan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

ancssf@gmail.com

R. Badlishah Ahmad, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

badli@unimap.edu.my

Naimah Yaakob, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

naimahyaakob@unimap.edu.my

Bi Lynn Ong , Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

drlynn@unimap.edu.my

Nadira Islam, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

nadira.swe@diu.edu.bd

Use of Natural Language Processing for the Detection of Hate Speech on Social Media

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biographies

Mehedi Hasan Shohan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Kazi Rifat Ahmed, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Nur Farhan Kahar, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Nusrat Jahan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Md. Maruf Hassan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

R. Badlishah Ahmad, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Naimah Yaakob, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Bi Lynn Ong , Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Nadira Islam, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Downloads

Published

Issue

Section

Most read articles by the same author(s)

araset

THE PUBLISHER

PREP

SUBMISSION

Keywords

JOURNAL METRICS AND INDEXING

DISTRIBUTION OF AUTHORS

Information

Use of Natural Language Processing for the Detection of Hate Speech on Social Media

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biographies

Mehedi Hasan Shohan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Kazi Rifat Ahmed, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Nur Farhan Kahar, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Nusrat Jahan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Md. Maruf Hassan, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

R. Badlishah Ahmad, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Naimah Yaakob, Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Bi Lynn Ong , Faculty of Electronic Engineering & Technology, Universiti Malaysia Perlis, 02600 Arau, Perlis, Malaysia

Nadira Islam, Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh

Downloads

Published

Issue

Section

Most read articles by the same author(s)

araset

THE PUBLISHER

PREP

SUBMISSION

Keywords

JOURNAL METRICS AND INDEXING

DISTRIBUTION OF AUTHORS

RELATED PUBLICATION

Information