Abstract
Recent advancements in artificial intelligence algorithms for medical imaging show significant potential in automating the detection of lung infections from chest radiograph scans. However, current approaches often focus solely on either 2-D or 3-D scans, failing to leverage the combined advantages of both modalities. Moreover, conventional slice-based methods place a manual burden on radiologists for slice selection. To overcome these challenges, we propose the Recurrent 3-D Multi-level Vision Transformer (R3DM-ViT) model, capable of handling multimodal data to enhance diagnostic accuracy. Our quantitative evaluations demonstrate that R3DM-ViT surpasses existing methods, achieving an impressive accuracy of 96.67%, F1-score of 96.88%, mean average precision of 96.75%, and mean average recall of 97.02%. This research signifies a significant stride forward in the automated detection of lung infections through multimodal imaging.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings |
| Publisher | IEEE Computer Society |
| Pages | 3205-3211 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798350349399 |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
| Event | 31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates Duration: 27 Oct 2024 → 30 Oct 2024 |
Publication series
| Name | Proceedings - International Conference on Image Processing, ICIP |
|---|---|
| ISSN (Print) | 1522-4880 |
Conference
| Conference | 31st IEEE International Conference on Image Processing, ICIP 2024 |
|---|---|
| Country/Territory | United Arab Emirates |
| City | Abu Dhabi |
| Period | 27/10/24 → 30/10/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE
Keywords
- CBMIR
- Computer-aided diagnosis
- Medical image retrieval
- R3DM-ViT
- lung infection
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition
- Signal Processing