Abstract
This paper discribes how stroke features in document images are extracted and used for the recognition of printed Arabic characters. It is of importance to provide a good base representation that facilitate analysis and processing of document images. The strokes are extracted by a method called Minimum Covering Runs (MCR)[1]. This method of representing binary images by a minimum number of horizontal and vertical runs is used as a preprocessing step. The strokes are labeled and ordered, a feature space for the 100 shapes of the 28 Arabic characters is build. The system is under developement but the recognition rate obtained at this stage, 95.5% is encouraging.
Original language | English |
---|---|
Title of host publication | Image Analysis and Processing - 8th International Conference, ICIAP 1995, Proceedings |
Editors | Carlo Braccini, Leila DeFloriani, Gianni Vernazza |
Publisher | Springer Verlag |
Pages | 557-562 |
Number of pages | 6 |
ISBN (Print) | 3540602984, 9783540602989 |
DOIs | |
State | Published - 1995 |
Externally published | Yes |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 974 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Bibliographical note
Publisher Copyright:© Springer-Verlag Berlin Heidelberg 1995.
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science