Bilingual Cyber-aggression detection on social media using LSTM autoencoder

Kirti Kumari, Jyoti Prakash Singh*, Yogesh Kumar Dwivedi, Nripendra Pratap Rana

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

Cyber-aggression is an offensive behaviour attacking people based on race, ethnicity, religion, gender, sexual orientation and other traits. It has become a major issue plaguing the online social media. In this research, we have developed a deep learning-based model to identify different levels of aggression (direct, indirect and no aggression) in a social media post in a bilingual scenario. The model is an autoencoder built using the LSTM network and trained with non-aggressive comments only. Any aggressive comment (direct or indirect) will be regarded as an anomaly to the system and will be marked as Overtly (direct) or Covertly (indirect) aggressive comment depending on the reconstruction loss by the autoencoder. The validation results on the dataset from two popular social media sites: Facebook and Twitter with bilingual (English and Hindi) data outperformed the current state-of-the-art models with improvements of more than 11% on the test sets of the English dataset and more than 6% on the test sets of the Hindi dataset.

Original languageEnglish
Pages (from-to)8999-9012
Number of pages14
JournalSoft Computing
Volume25
Issue number14
DOIs
StatePublished - Jul 2021
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Keywords

  • Autoencoder
  • Cyber-aggression
  • Cyberbullying
  • Long Short-Term Memory
  • Online social networks

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Software
  • Geometry and Topology

Fingerprint

Dive into the research topics of 'Bilingual Cyber-aggression detection on social media using LSTM autoencoder'. Together they form a unique fingerprint.

Cite this