Dimensionality Reduction in Handwritten Digit Recognition

  • Mayesha Bintha Mizan*
  • , Muhammad Sayyedul Awwab
  • , Anika Tabassum
  • , Kazi Shahriar
  • , Mufti Mahmud
  • , David J. Brown
  • , Muhammad Arifur Rahman
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

For visualization, the concept of dimension is normally enclosed to 2–3 degrees in individuals. A computing node can extend it significantly. However, any increase in the number of dimensions usually introduces an extra computational burden, and it becomes more challenging to extract the exact information. Therefore, dimensionality reduction methods are an increasingly important area of study to help identify methods to mitigate challenges associated with high-dimensional feature sets. Handwritten digit recognition is one of the most relevant fields of study due to the variety of issues faced such as the age of texts, the professional context and norms in which the text is written, and individual differences in writing styles. Research on handwritten digit recognition using various algorithms has been conducted in a variety of languages. In the Bangla character set, there are ten digits. Due to geometry, complicated forms, and similarities between the individual numerals, individual characters are difficult to identify. Also, there are limited open datasets available to researchers to conduct Bangla digit recognition upon. This work discusses dimensionality reduction techniques used in the Bangla Handwritten Digit dataset NumtaDB. Principal Component Analysis (PCA), Neighborhood Component Analysis (NCA), and Linear Discriminant Analysis (LDA) algorithms are examined as feature extraction techniques. CNN is a deep learning technique to classify the input automatically. Over the years, CNN has found a good grip over classifying images for computer visions and now it is being used in other domains too. The numeric digits are then classified using CNN utilizing the lower dimension vectors acquired. These models can recognize most of the digits successfully with a satisfactory level of performance for identifying different digits.

Original languageEnglish
Title of host publicationProceedings of Trends in Electronics and Health Informatics - TEHI 2022
EditorsMufti Mahmud, Claudia Mendoza-Barrera, M. Shamim Kaiser, Anirban Bandyopadhyay, Kanad Ray, Eduardo Lugo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages35-50
Number of pages16
ISBN (Print)9789819919154
DOIs
StatePublished - 2023
Externally publishedYes
Event2nd International Conference on Trends in Electronics and Health Informatics, TEHI 2022 - Puebla, Mexico
Duration: 7 Dec 20229 Dec 2022

Publication series

NameLecture Notes in Networks and Systems
Volume675 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference2nd International Conference on Trends in Electronics and Health Informatics, TEHI 2022
Country/TerritoryMexico
CityPuebla
Period7/12/229/12/22

Bibliographical note

Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

Keywords

  • Bangla handwritten digit recognition
  • CNN
  • Dimensionality reduction

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Dimensionality Reduction in Handwritten Digit Recognition'. Together they form a unique fingerprint.

Cite this