Arabic handwriting recognition using structural and syntactic pattern attributes

  • Mohammad Tanvir Parvez*
  • , Sabri A. Mahmoud
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

99 Scopus citations

Abstract

In this paper, we present research results on off-line Arabic handwriting recognition using structural techniques. Statistical methods have been more common in the reported research on Arabic handwriting recognition. Structural methods have remained largely unexplored in this regard. However, both statistical and structural techniques can be effectively integrated in multi-classifier based systems. This paper presents, to our knowledge, the first integrated offline Arabic handwritten text recognition system based on structural techniques. In implementing the system, several novel algorithms and techniques for structural recognition of Arabic handwriting are introduced. An Arabic text line is segmented into words/sub-words and dots are extracted. An adaptive slant correction algorithm that is able to correct the different slant angles of the different components of a text line is presented. A novel segmentation algorithm, which is integrated into the recognition phase, is designed based on the nature of Arabic writing and utilizes a polygonal approximation algorithm. This is followed by Arabic character modeling by 'fuzzy' polygons and later recognized using a novel fuzzy polygon matching algorithm. Dynamic programming is used to select best hypotheses of a sequence of recognized characters for each word/sub-word. In addition, several other key ideas, namely prototype selection using set-medians, lexicon reduction using dot-descriptors etc. are utilized to design a robust handwriting recognition system. Results are reported on the benchmarking IfN/ENIT database of Tunisian city names which indicate the robustness and the effectiveness of our system. The recognition rates are comparable to multi-classifier implementations and better than single classifier systems.

Original languageEnglish
Pages (from-to)141-154
Number of pages14
JournalPattern Recognition
Volume46
Issue number1
DOIs
StatePublished - Jan 2013

Bibliographical note

Funding Information:
The authors would like to thank King Fahd University of Petroleum & Minerals (KFUPM) for supporting this research and providing the computing facilities. This work was supported by KACST NSTIP project 08-INF99-4 “Automatic Recognition of Handwritten Arabic Text (ARHAT)”. The authors also thank the anonymous reviewers for their comments that improved this work.

Keywords

  • Arabic OCR
  • Arabic handwriting recognition
  • Median computation
  • Nearest neighbors
  • Structural recognition

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Arabic handwriting recognition using structural and syntactic pattern attributes'. Together they form a unique fingerprint.

Cite this