Fuzziness Based Semi-supervised Deep Learning for Multimodal Image Classification

Abeda Asma, Dilshad Noor Mostafa, Koli Akter, Mufti Mahmud, Muhammed J.A. Patwary*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Predicting a class or label of text-aided image has practical application in a range of domains including social media, machine learning and medical domain. Usually, supervised learning model is used to make such predictions where labeled data is mandatory, which is time consuming and required manual help. Classification of images are accomplished on visual features only by utilizing deep learning. Employing semi-supervised learning is a viable answer to these issues that needs a few label sample to classify huge unlabeled samples. The paper suggests a novel semi-supervised deep learning method based on fuzziness, called (FSSDL-MIC) for multimodal image classification to tackle the challenge of web image classification. For the first time in this scenario, we integrate Multilayer perceptron for textual features and MobileNetV2 for visual features to create a multimodal paradigm. Using data from PASCAL VOC’07, experiments have revealed that the proposed framework achieves significant improvement and outperforms modern techniques for multimodal image categorization. We also see a positive impact of low fuzzy sample when final model trained with visual features only.

Original languageEnglish
Title of host publicationMachine Intelligence and Emerging Technologies - First International Conference, MIET 2022, Proceedings
EditorsMd. Shahriare Satu, Mohammad Ali Moni, M. Shamim Kaiser, Mohammad Shamsul Arefin, Mohammad Shamsul Arefin
PublisherSpringer Science and Business Media Deutschland GmbH
Pages91-105
Number of pages15
ISBN (Print)9783031346217
DOIs
StatePublished - 2023
Externally publishedYes
Event1st International Conference on Machine Intelligence and Emerging Technologies, MIET 2022 - Noakhali, Bangladesh
Duration: 23 Sep 202225 Sep 2022

Publication series

NameLecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
Volume491 LNICST
ISSN (Print)1867-8211
ISSN (Electronic)1867-822X

Conference

Conference1st International Conference on Machine Intelligence and Emerging Technologies, MIET 2022
Country/TerritoryBangladesh
CityNoakhali
Period23/09/2225/09/22

Bibliographical note

Publisher Copyright:
© 2023, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.

Keywords

  • Deep Learning
  • Fuzziness
  • Multimodal learning
  • Semi-supervised Learning

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Fuzziness Based Semi-supervised Deep Learning for Multimodal Image Classification'. Together they form a unique fingerprint.

Cite this