Detection of questions in Arabic audio monologues using prosodic features

Omair Khan*, Wasfi G. Al-Khatib, Cheded Lahouari

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application we investigate is that of identifying question sentences in Arabic Monologue Lectures. Languages other than Arabic have received a lot of attention in this regard. We approach this problem by first segmenting the sentences from the continuous speech using intensity and duration features. Prosodic features are, then, extracted from each sentence. These features are used as input to decision trees to classify each sentence into either Question or Non Question sentence. Our results suggest that questions are cued by more than one type of prosodic features in natural Arabic speech. We used C4.5 decision trees for classification and achieved 75.7% accuracy. Feature specific analysis further reveals that energy and fundamental frequency features are mainly responsible for discriminating between questions and non-question sentences.

Original languageEnglish
Title of host publicationProceedings - 9th IEEE International Symposium on Multimedia, ISM 2007
Pages29-36
Number of pages8
DOIs
StatePublished - 2007

Publication series

NameProceedings - 9th IEEE International Symposium on Multimedia, ISM 2007

ASJC Scopus subject areas

  • General Computer Science
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'Detection of questions in Arabic audio monologues using prosodic features'. Together they form a unique fingerprint.

Cite this