HatEmoTweet: Low-level Emotion Classifications And Spatiotemporal Trends Of Hate And Offensive COVID-19 Tweets
Abstract
Social media platforms (like Twitter) positively and negatively impact users in diverse societies; one of Twitter's negative effects is the usage of hate and offensive language. Hate speech fosters prejudice; it also harms the vulnerable. There are always emotions associated with hateful and offensive actions. This work addressed hate and offensive tweet detection, low-level emotional classifications using 28 labels to train transformers models in three ways (model 1— BERTG28 , model 2— BERTG27 , and model 3— RoBERTaG27) before predicting the hateful and offensive tweets emotions. Model 1 was trained on low-level labels, and models 2 and 3 were trained on 27 labels excluding the neutral label. This study performed topic modeling to extract the discussed theme, spatiotemporal trends to determine where and when these tweets occurred, and event summarization for identified hate and offensive tweets. GoEmotions and Ekman were used for direct and indirect assessment, respectively, to evaluate the model's precision, recall, and F1-score. In terms of precision evaluation, the model 1 outperformed Google Research on GoEmotions. Furthermore, this study's model 2 and model 3 outperformed the Google research on both the GoEmotions and Ekman's evaluation in terms of precision and F1-score. Generally, model 2 was the best model in the analysis for both recall and F1-score while model 3 performed better for precision. Due to the training on samples without the neutral label, model 2 obtained 27% and model 3 achieved 29% label prediction out of the 30% neutral samples that was predicted in model 1 for hate and offensive tweets. This is a significant improvement to optimize classified emotions that are not truly neutral by eliminating the false neutral class.
Recommended Citation
A. Adesokan et al., "HatEmoTweet: Low-level Emotion Classifications And Spatiotemporal Trends Of Hate And Offensive COVID-19 Tweets," Social Network Analysis and Mining, vol. 13, no. 1, article no. 136, Springer, Dec 2023.
The definitive version is available at https://doi.org/10.1007/s13278-023-01132-6
Department(s)
Computer Science
Keywords and Phrases
COVID-19 pandemic; Emotion classification; Event summarization; Spatiotemporal; Topic modeling
International Standard Serial Number (ISSN)
1869-5469; 1869-5450
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2023 Springer, All rights reserved.
Publication Date
01 Dec 2023
Comments
National Science Foundation, Grant CNS-2219614