Statistical analysis of Arabic phonemes used in Arabic speech recognition

Khalid M.O. Nahar*, Mustafa Elshafei, Wasfi G. Al-Khatib, Husni Al-Muhtaseb, Mansour M. Alghamdi

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

This study is specifically concerned with the statistical analysis of the Arabic phonemes due to its significant role in continuous Arabic Speech Recognition System (ASR). When building Arabic speech recognizer , the number of frames that a phoneme occupy, the phoneme boundary and the number of Hidden Markov Model necessary to represent the phoneme are greatly helpful in enhancing the recognition accuracy. In this paper we statically analyze KACST-5 hours corpus, which was used in Arabic speech recognition for both training and recognition. The results showed different set of tables and figures that are helpful for Arabic speech researchers. The paper comes up with a clustering graph for Arabic phonemes based on the median and a trigram table for all phonemes which represent the frequency of a phoneme to appear in trigram. The study was consistent and agreed with Arabic speech scientist's observations.

Original languageEnglish
Title of host publicationNeural Information Processing - 19th International Conference, ICONIP 2012, Proceedings
Pages533-542
Number of pages10
EditionPART 1
DOIs
StatePublished - 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume7663 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • Acoustic Model
  • Arabic Speech Recognition
  • HMM
  • Kacst Arabic speech corpus
  • MFCC files
  • Median
  • Mode
  • Phoneme
  • Standard Deviation
  • Variance

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Statistical analysis of Arabic phonemes used in Arabic speech recognition'. Together they form a unique fingerprint.

Cite this