3D Object Classification with Selective Multi-View Fusion and Shape Rendering

Mona Alzahrani, Muhammad Usman*, Randah Alharbi, Saeed Anwar, Ajmal Mian, Tarek Helmy

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

3D classification is complex and challenging because of high-dimensional data, the intricate nature of their spatial relationships, and viewpoint variations. We fill the gap in view-based 3D object classification by examining the factors that influence classification's effectiveness via determining their respective merits in feature extraction for 3D object recognition by comparing CNN-based and Transformer-based backbone networks side-by-side. Our research extends to evaluating various fusion strategies to determine the most effective method for integrating multiple views and ascertain the optimal number of views that balances classification and computation. We also probe into the effectiveness of different feature types from rendering techniques in accurately depicting 3D objects. This investigation is supported by an extensive experimental framework, incorporating a diverse set of 3D objects from the ModelNet40 dataset. Finally, based on the analysis, we present a Selective Multi-View deep model (SelectiveMV) that shows efficient performance and provides high accuracy given a few views.

Original languageEnglish
Title of host publicationProceedings - 2024 25th International Conference on Digital Image Computing
Subtitle of host publicationTechniques and Applications, DICTA 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages351-358
Number of pages8
ISBN (Electronic)9798350379037
DOIs
StatePublished - 2024
Event25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024 - Perth, Australia
Duration: 27 Nov 202429 Nov 2024

Publication series

NameProceedings - 2024 25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024

Conference

Conference25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024
Country/TerritoryAustralia
CityPerth
Period27/11/2429/11/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

ASJC Scopus subject areas

  • Signal Processing
  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of '3D Object Classification with Selective Multi-View Fusion and Shape Rendering'. Together they form a unique fingerprint.

Cite this