Arabic broadcast news transcription system

Mansour Alghamdi*, Moustafa Elshafei, Husni Al-Muhtaseb

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

38 Scopus citations

Abstract

This paper describes the development of an Arabic broadcast news transcription system. The presented system is a speaker-independent large vocabulary natural Arabic speech recognition system, and it is intended to be a test bed for further research into the open ended problem of achieving natural language man-machine conversation. The system addresses a number of challenging issues pertaining to the Arabic language, e.g. generation of fully vocalized transcription, and rule-based spelling dictionary. The developed Arabic speech recognition system is based on the Carnegie Mellon university Sphinx tools. The Cambridge HTK tools were also utilized at various testing stages. The system was trained on 7.0 hours of a 7.5 hours of Arabic broadcast news corpus and tested on the remaining half an hour. The corpus was made to focus on economics and sport news. At this experimental stage, the Arabic news transcription system uses five-state HMM for triphone acoustic models, with 8 and 16 Gaussian mixture distributions. The state distributions were tied to about 1680 senons. The language model uses both bi-grams and tri-grams. The test set consisted of 400 utterances containing 3585 words. The Word Error Rate (WER) came initially to 10.14 percent. After extensive testing and tuning of the recognition parameters the WER was reduced to about 8.61% for non-vocalized text transcription.

Original languageEnglish
Pages (from-to)183-195
Number of pages13
JournalInternational Journal of Speech Technology
Volume10
Issue number4
DOIs
StatePublished - Dec 2007

Bibliographical note

Funding Information:
Acknowledgements This work was supported by a grant #AT-24-94 by King Abdulaziz City of Science and Technology. The authors would like also to thank King Fahd University of Petroleum and Minerals for its support in carrying out this project.

Keywords

  • Arabic natural language
  • Arabic speech corpus
  • Arabic speech recognition
  • HMM
  • News transcription
  • Phonetic dictionary
  • Sphinx training

ASJC Scopus subject areas

  • Software
  • Language and Linguistics
  • Human-Computer Interaction
  • Linguistics and Language
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Arabic broadcast news transcription system'. Together they form a unique fingerprint.

Cite this