Deep models for multi-view 3D object recognition: a review

Mona Alzahrani, Muhammad Usman*, Salma Kammoun Jarraya, Saeed Anwar, Tarek Helmy

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This review paper focuses on the progress of deep learning-based methods for multi-view 3D object recognition. It covers the state-of-the-art techniques in this field, specifically those that utilize 3D multi-view data as input representation. The paper provides a comprehensive analysis of the pipeline for deep learning-based multi-view 3D object recognition, including the various techniques employed at each stage. It also presents the latest developments in CNN-based and transformer-based models for multi-view 3D object recognition. The review discusses existing models in detail, including the datasets, camera configurations, view selection strategies, pre-trained CNN architectures, fusion strategies, and recognition performance. Additionally, it examines various computer vision applications that use multi-view classification. Finally, it highlights future directions, factors impacting recognition performance, and trends for the development of multi-view 3D object recognition method.

Original languageEnglish
Article number323
JournalArtificial Intelligence Review
Volume57
Issue number12
DOIs
StatePublished - Dec 2024

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

Keywords

  • 3D object classification
  • 3D object recognition
  • 3D object retrieval
  • Multi-view conventional neural network
  • Multi-view object recognition

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Deep models for multi-view 3D object recognition: a review'. Together they form a unique fingerprint.

Cite this