Data-driven Arabic phoneme recognition using varying number of HMM states

Khalid M.O. Nahar*, Wasfi G. Al-Khatib, Moustafa Elshafei, Husni Al-Muhtaseb, Mansour M. Alghamdi

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Continuous Arabic Speech Recognition, appears in many real life applications. Its speed, accuracy and improvement are highly dependent on the accuracy of the language phonemes set. The main goal of this research is to recognize and transcribe the Arabic phonemes based on a data-driven approach. We built a phoneme recognizer based on a data driven approach using HTK tool. Different numbers of Gaussian mixtures with different numbers of HMM states were used in modeling the Arabic phonemes in order to reach the best configuration. The corpus used consists of about 4000 files, representing 5 recorded hours of modern standard Arabic of TV-News. The maximum phoneme recognition accuracy reached was 56.79%. This result is very encouraging and shows the viability of our approach as compared to using a fixed number of HMM states.

Original languageEnglish
Title of host publication2013 1st International Conference on Communications, Signal Processing and Their Applications, ICCSPA 2013
DOIs
StatePublished - 2013

Publication series

Name2013 1st International Conference on Communications, Signal Processing and Their Applications, ICCSPA 2013

Keywords

  • Arabic Speech Recognition
  • KFUPM Arabic speech
  • Phoneme recognition
  • corpus HMM

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Signal Processing

Fingerprint

Dive into the research topics of 'Data-driven Arabic phoneme recognition using varying number of HMM states'. Together they form a unique fingerprint.

Cite this