Abstract
This paper presents the details of a comprehensive database of Printed Arabic text for Arabic text recognition research. It consists of scanned images of different forms of Arabic printed text (viz. book chapters, advertisements, magazines, newspapers, and reports) scanned with 200, 300, and 600 dpi resolutions. A total of 6954 pages are scanned. The database may be utilized by Arabic printed text recognition research community. It may be used as a benchmark database where researchers can evaluate their algorithms and results compared with published work of other researchers using the same database. To the best of our knowledge, there is no public comprehensive printed Arabic text database that is freely available. Hence, this database may address this deficiency in Arabic printed text recognition research. This database will be made freely available to interested researchers. In addition, this paper presents a software GUI environment to make the manipulation of the created database easier. Moreover, the software GUI provides a number of imageprocessing functions that can be used in the field of automatic text recognition.
| Original language | English |
|---|---|
| Pages (from-to) | 587-597 |
| Number of pages | 11 |
| Journal | WSEAS Transactions on Information Science and Applications |
| Volume | 7 |
| Issue number | 4 |
| State | Published - Apr 2010 |
Keywords
- Arabic printed text database
- Arabic text recognition
- OCR
- OCR datasets
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
Fingerprint
Dive into the research topics of 'Benchmark database and GUI environment for printed arabic text recognition research'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver