Deep learning for classification and as tapped-feature generator in medieval word-image recognition

  • Sukalpa Chanda
  • , Emmanuel Okafor
  • , Sebastien Hamel
  • , Dominique Stutzmann
  • , Lambert Schomaker

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Historical manuscripts are the main source of information about past. In recent years, digitization of large quantities of historical handwritten documents is in vogue. This trend gives access to a plethora of information about our medieval past. Such digital archives can be more useful if automatic indexing and retrieval of document images can be provided to the end users of a digital library. An automatic transcription of the full digital archive using traditional Optical Character Recognition (OCR) is still not possible with sufficient accuracy. If full transcription is not available, the end users are interested in indexing and retrieving of particular document pages of their interest. Hence recognition of certain keywords from within the corpus will be sufficient to meet the end users needs. Recently, deep-learning based methods have shown competence in image classification problems. However, one bottleneck with deep-learning based techniques is that it requires a huge amount of training samples per class. Since the number of samples per word class is scarce for collections that are freshly scanned, this is a serious hindrance for direct usage of the deep-learning technique for the purpose of word image recognition in historical document images. This paper aims to investigate the problem of recognizing words from historical document images using a deep-learning based framework for feature extraction and classification while countering the problem of the low amount of image samples using off-line data augmentation techniques. Encouraging results (highest accuracy of 90.03%) were obtained while dealing with 365 different word classes.

Original languageEnglish
Title of host publicationProceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages217-222
Number of pages6
ISBN (Electronic)9781538633465
DOIs
StatePublished - 22 Jun 2018
Externally publishedYes
Event13th IAPR International Workshop on Document Analysis Systems, DAS 2018 - Vienna, Austria
Duration: 24 Apr 201827 Apr 2018

Publication series

NameProceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018

Conference

Conference13th IAPR International Workshop on Document Analysis Systems, DAS 2018
Country/TerritoryAustria
CityVienna
Period24/04/1827/04/18

Bibliographical note

Publisher Copyright:
© 2018 IEEE.

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint

Dive into the research topics of 'Deep learning for classification and as tapped-feature generator in medieval word-image recognition'. Together they form a unique fingerprint.

Cite this