AT-ODTSA: a Dataset of Arabic Tweets for Open Domain Targeted Sentiment Analysis

  • Shaaban Sahmoud
  • , Shadi Abudalfa*
  • , Wisam Elmasry
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

In the field of sentiment analysis, most of research has conducted experiments on datasets collected from Twitter for manipulating a specific language. Little number of datasets has been collected for detecting sentiments expressed in Arabic tweets. Moreover, very limited number of such datasets is suitable for conducting recent research directions such as target dependent sentiment analysis and open-domain targeted sentiment analysis. Thereby, there is a dire need for reliable datasets that are specifically acquired for open-domain targeted sentiment analysis with Arabic language. Therefore, in this paper, we introduce AT-ODTSA, a dataset of Arabic Tweets for Open-Domain Targeted Sentiment Analysis, which includes Arabic tweets along with labels that specify targets (topics) and sentiments (opinions) expressed in the collected tweets. To the best of our knowledge, our work presents the first dataset that manually annotated for applying Arabic open-domain targeted sentiment analysis. We also present a detailed statistical analysis of the dataset. The AT-ODTSA dataset is suitable for train numerous machine learning models such as a deep learning-based model.

Original languageEnglish
Pages (from-to)1299-1307
Number of pages9
JournalInternational Journal of Computing and Digital Systems
Volume11
Issue number1
DOIs
StatePublished - 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022 University of Bahrain. All rights reserved.

Keywords

  • Arabic Tweets
  • Open-Domain Targeted Sentiment Analysis
  • Sentiment Analysis
  • Target Dependent

ASJC Scopus subject areas

  • Information Systems
  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Graphics and Computer-Aided Design
  • Artificial Intelligence
  • Management of Technology and Innovation

Fingerprint

Dive into the research topics of 'AT-ODTSA: a Dataset of Arabic Tweets for Open Domain Targeted Sentiment Analysis'. Together they form a unique fingerprint.

Cite this