Abstract
Recognition of cursive handwritten Arabic text is a difficult problem because of context-sensitive character shapes, the non-uniform spacing between words and within a word, diverse placements of dots, and diacritics, and very low inter-class variation among individual classes. In this paper, we review and investigate different deep learning architectures and modeling choices for Arabic handwriting recognition. Further, we address the problem that imbalanced data sets present to deep learning systems. In order to address this issue, we are presenting a novel adaptive data-augmentation algorithm to promote class diversity. This algorithm assigns a weight to each word in the database lexicon. This weight is calculated based on the average probability of each class in a word. Experimental results on the IFN/ENIT and AHDB databases have shown that our presented approach yields state-of-the-art results.
Original language | English |
---|---|
Article number | 9091836 |
Pages (from-to) | 89882-89898 |
Number of pages | 17 |
Journal | IEEE Access |
Volume | 8 |
DOIs | |
State | Published - 2020 |
Bibliographical note
Funding Information:This work was supported by the King Fahd University of Petroleum & Minerals, Dhahran, Saudi Arabia.
Publisher Copyright:
© 2013 IEEE.
Keywords
- Arabic handwriting recognition (AHR)
- Connectionist temporal classification (CTC)
- Convolutional neural networks (CNN)
- Deep learning neural network (DLNN)
- IFN/ENIT database
- Long short-term memory (LSTM)
- Recurrent neural network (RNN)
ASJC Scopus subject areas
- Computer Science (all)
- Materials Science (all)
- Engineering (all)
- Electrical and Electronic Engineering