Multi-label Generalized Zero-Shot Learning Using Identifiable Variational Autoencoders

Muqaddas Gull*, Omar Arif

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Multi-label Zero-Shot Learning (ZSL) is an extension of traditional single-label ZSL, where the objective is to accurately classify images containing multiple unseen classes that are not available during training. Current techniques depends on attention mechanisms and Generative Adversarial Networks (GAN) to address multi-label ZSL and Generalized Zero-Shot Learning (GZSL) challenge. However, generating features for both multi-label ZSL and GZSL in the context of disentangled representation learning remains unexplored. In this paper, we propose an identifiable Variational Autoencoder (iVAE) based generative framework for multi-label ZSL and GZSL. The main idea of our proposed approach is to learn disentangled representations for generating semantically consistent multi-label features using an attribute-level feature fusion technique. We perform comprehensive experiments on two benchmark datasets, NUS-WIDE and MS COCO, for both multi-label ZSL and GZSL. Furthermore, disentangled representation learning for both multi-label ZSL and GZSL on standard datasets achieves commendable performance as compared to existing methods.

Original languageEnglish
Title of host publicationExtended Reality - International Conference, XR Salento 2023, Proceedings
EditorsLucio Tommaso De Paolis, Pasquale Arpaia, Marco Sacco
PublisherSpringer Science and Business Media Deutschland GmbH
Pages35-50
Number of pages16
ISBN (Print)9783031434037
DOIs
StatePublished - 2023
Externally publishedYes
EventProceedings of the International Conference on extended Reality, XR SALENTO 2023 - Lecce, Italy
Duration: 6 Sep 20239 Sep 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14219 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceProceedings of the International Conference on extended Reality, XR SALENTO 2023
Country/TerritoryItaly
CityLecce
Period6/09/239/09/23

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

Keywords

  • Attribute-Level Feature Fusion
  • Disentangled Representation Learning
  • Generalized Zero-Shot Learning
  • Zero-Shot Learning

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Multi-label Generalized Zero-Shot Learning Using Identifiable Variational Autoencoders'. Together they form a unique fingerprint.

Cite this