Emoji-Based Sentiment Analysis of Arabic Microblogs Using Machine Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Nowadays, there is an explosive growth of social data analytics to measure human reactions to various events. However, the application of natural language processing for text analysis and mining of microblogs is a daunting task with an excessive complexity. In this paper, we explore the idea of adopting new non-verbal features for sentiment analysis of microblogs. We considered 969 emojis and prepared a dataset of 2091 instances written in multidialectal Arabic and each contains at least one emoji. Several machine learning algorithms are evaluated on the suggested features. The experimental results demonstrated that emoji-based features alone can be a very effective means for detecting sentiment polarity with high performance. For instance, when using a multinomial naive Bayes classifier, an F 1 score of 80.30% and AUC of 87.30% were achieved using the 250 most relevant emojis.

Original languageEnglish
Title of host publication21st Saudi Computer Society National Computer Conference, NCC 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538641095
DOIs
StatePublished - 27 Dec 2018

Publication series

Name21st Saudi Computer Society National Computer Conference, NCC 2018

Bibliographical note

Publisher Copyright:
© 2018 IEEE.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems
  • Health Informatics

Fingerprint

Dive into the research topics of 'Emoji-Based Sentiment Analysis of Arabic Microblogs Using Machine Learning'. Together they form a unique fingerprint.

Cite this