Abstract
Predicting a class or label of text-aided image has practical application in a range of domains including social media, machine learning and medical domain. Usually, supervised learning model is used to make such predictions where labeled data is mandatory, which is time consuming and required manual help. Classification of images are accomplished on visual features only by utilizing deep learning. Employing semi-supervised learning is a viable answer to these issues that needs a few label sample to classify huge unlabeled samples. The paper suggests a novel semi-supervised deep learning method based on fuzziness, called (FSSDL-MIC) for multimodal image classification to tackle the challenge of web image classification. For the first time in this scenario, we integrate Multilayer perceptron for textual features and MobileNetV2 for visual features to create a multimodal paradigm. Using data from PASCAL VOC’07, experiments have revealed that the proposed framework achieves significant improvement and outperforms modern techniques for multimodal image categorization. We also see a positive impact of low fuzzy sample when final model trained with visual features only.
Original language | English |
---|---|
Title of host publication | Machine Intelligence and Emerging Technologies - First International Conference, MIET 2022, Proceedings |
Editors | Md. Shahriare Satu, Mohammad Ali Moni, M. Shamim Kaiser, Mohammad Shamsul Arefin, Mohammad Shamsul Arefin |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 91-105 |
Number of pages | 15 |
ISBN (Print) | 9783031346217 |
DOIs | |
State | Published - 2023 |
Externally published | Yes |
Event | 1st International Conference on Machine Intelligence and Emerging Technologies, MIET 2022 - Noakhali, Bangladesh Duration: 23 Sep 2022 → 25 Sep 2022 |
Publication series
Name | Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST |
---|---|
Volume | 491 LNICST |
ISSN (Print) | 1867-8211 |
ISSN (Electronic) | 1867-822X |
Conference
Conference | 1st International Conference on Machine Intelligence and Emerging Technologies, MIET 2022 |
---|---|
Country/Territory | Bangladesh |
City | Noakhali |
Period | 23/09/22 → 25/09/22 |
Bibliographical note
Publisher Copyright:© 2023, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.
Keywords
- Deep Learning
- Fuzziness
- Multimodal learning
- Semi-supervised Learning
ASJC Scopus subject areas
- Computer Networks and Communications