A novel approach for skew estimation of document images in OCR system

  • M. Sarfraz*
  • , A. Zidouri
  • , S. A. Shahab
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Scopus citations

Abstract

Optical Character Recognition (OCR) is an area which has always received special attention. OCR systems are typically built on the strategy of divide and conquer, rather than recognizing documents at one go. They utilize several stages during the course of recognition. There have been many stages in a typical OCR system, preprocessing stage in considered to be indispensable. An input image or information need to be normalized and converted into format acceptable by OCR system. OCR systems typically assume that documents were printed with a single direction of the text and that the acquisition process did not introduce a relevant skew. Practically this assumption is not very strong and printed document could be skewed at some angle with horizontal axis. In this paper, we have proposed a new technique for skew estimation of image document. In the proposed scheme, multiscale properties of an image are utilized together with Principal Component Analysis to estimate the orientation of principal axis of clustered data.

Original languageEnglish
Title of host publicationProceedings of the Conference on Computer Graphics, Imaging and Vision
Subtitle of host publicationNew Trends 2005
Pages175-180
Number of pages6
DOIs
StatePublished - 2005

Publication series

NameProceedings of the Conference on Computer Graphics, Imaging and Vision: New Trends 2005
Volume2005

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'A novel approach for skew estimation of document images in OCR system'. Together they form a unique fingerprint.

Cite this