TY - GEN
T1 - Identification of Question and Non-Question segments in Arabic Monologue based on prosodic features using type-2 fuzzy logic systems
AU - Olatunji, Sunday Olusanya
AU - Cheded, Lahouari
AU - Al-Khatib, Wasfi G.
PY - 2010
Y1 - 2010
N2 - In this work, we propose the use of type-2 fuzzy logic systems (type-2 FLS) to identify question and nonquestion segments in an Arabic monologue based on prosodic features. Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application investigated is that of identifying question sentences in Arabic Monologue Lectures. Languages, other than Arabic, have received a lot of attention in this regard, hence the need for this research work concentrating on the Arabic language. Having first segmented the sentences from the continuous speech using energy and duration features, prosodic features are, then, extracted from each sentence. These features are used as input to the two proposed classifiers to classify each sentence into either Question or Non Question sentence. Results from this work have been compared with the previously used support vector machine (SVM), and the outputs indicate that the proposed type-2 FLS model outperforms SVM for all the experiments carried out, mainly due to its superior ability to handle uncertainties in the feature set.
AB - In this work, we propose the use of type-2 fuzzy logic systems (type-2 FLS) to identify question and nonquestion segments in an Arabic monologue based on prosodic features. Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application investigated is that of identifying question sentences in Arabic Monologue Lectures. Languages, other than Arabic, have received a lot of attention in this regard, hence the need for this research work concentrating on the Arabic language. Having first segmented the sentences from the continuous speech using energy and duration features, prosodic features are, then, extracted from each sentence. These features are used as input to the two proposed classifiers to classify each sentence into either Question or Non Question sentence. Results from this work have been compared with the previously used support vector machine (SVM), and the outputs indicate that the proposed type-2 FLS model outperforms SVM for all the experiments carried out, mainly due to its superior ability to handle uncertainties in the feature set.
KW - Arabic audio monologues
KW - Content analysis
KW - Prosodic analysis
KW - Question/Answer discrimination
KW - Type-2 fuzzy logic
UR - http://www.scopus.com/inward/record.url?scp=79952090123&partnerID=8YFLogxK
U2 - 10.1109/CIMSiM.2010.41
DO - 10.1109/CIMSiM.2010.41
M3 - Conference contribution
AN - SCOPUS:79952090123
SN - 9780769542621
T3 - Proceedings - 2nd International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2010
SP - 149
EP - 153
BT - Proceedings - 2nd International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2010
ER -