Abstract
This review paper focuses on the progress of deep learning-based methods for multi-view 3D object recognition. It covers the state-of-the-art techniques in this field, specifically those that utilize 3D multi-view data as input representation. The paper provides a comprehensive analysis of the pipeline for deep learning-based multi-view 3D object recognition, including the various techniques employed at each stage. It also presents the latest developments in CNN-based and transformer-based models for multi-view 3D object recognition. The review discusses existing models in detail, including the datasets, camera configurations, view selection strategies, pre-trained CNN architectures, fusion strategies, and recognition performance. Additionally, it examines various computer vision applications that use multi-view classification. Finally, it highlights future directions, factors impacting recognition performance, and trends for the development of multi-view 3D object recognition method.
Original language | English |
---|---|
Article number | 323 |
Journal | Artificial Intelligence Review |
Volume | 57 |
Issue number | 12 |
DOIs | |
State | Published - Dec 2024 |
Bibliographical note
Publisher Copyright:© The Author(s) 2024.
Keywords
- 3D object classification
- 3D object recognition
- 3D object retrieval
- Multi-view conventional neural network
- Multi-view object recognition
ASJC Scopus subject areas
- Language and Linguistics
- Linguistics and Language
- Artificial Intelligence