Abstract
This study presents an approach to detect Arabic dialects from opinion videos. It conducts a four-way multidialect classification task at utterance level from spoken language (audio), written language (text) and their combination. It also analyzes the effect of speaker on detecting dialects through presenting speaker-dependent and speaker-independent approaches. Word embedding based features are used to represent text modality whereas a combination of time-domain and frequency- domain acoustic features are used to represent audio modality. In case of speaker-independent, textual modality achieves significantly better results than audio modality while combining both modalities results in improving the results yielding F1 score of 63.61%. For speaker-dependent approach, similar performance is achieved for individual audio and text modalities while the best performance is achieved when combining both modalities with F1 score of 85.52%.
| Original language | English |
|---|---|
| Title of host publication | 2019 IEEE Symposium Series on Computational Intelligence, SSCI 2019 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 2470-2475 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781728124858 |
| DOIs | |
| State | Published - Dec 2019 |
Publication series
| Name | 2019 IEEE Symposium Series on Computational Intelligence, SSCI 2019 |
|---|
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- Dialect identification
- information fusion
- intelligent data analysis
- neural language processing
- opinion mining
- social media analysis
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Modeling and Simulation