From CNNs to Transformers: A Review of Evolving Deep Learning Architectures for Brain Tumor Classification

  • Muhammad Aamir
  • , Ziaur Rahman
  • , Nomica Choudhry
  • , Jameel Ahmed Bhutto
  • , Waheed Ahmed Abro
  • , Zemin Zhu*
  • *Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

2 Scopus citations

Abstract

Accurate brain tumor classification is critical for patient prognosis and treatment planning, yet manual interpretation of medical images like MRI is subject to variability. Deep learning has emerged as a powerful tool for this task. This review charts the evolution of deep learning architectures for brain tumor classification. We conducted a comprehensive literature review, focusing on the architectural progression from foundational Convolutional Neural Networks (CNNs) to modern attention-based Transformer models. Key datasets, evaluation metrics, and clinical challenges are synthesized. The review details the trajectory from early CNNs (e.g., AlexNet, VGG), which excelled at local feature extraction, to advanced variants like ResNet, U-Net, and DenseNet that improved performance and enabled segmentation-classification workflows. The paradigm then shifted to Vision Transformers (ViT, Swin Transformer) and hybrid models, which explicitly model long-range dependencies and global context, often achieving state-of-the-art results. Challenges such as domain shift, data scarcity, and the need for explainability (XAI) are persistent themes. While both CNNs and Transformers have demonstrated high accuracy, the current state-of-the-art often involves hybrid architectures that leverage the strengths of both. Future progress lies in developing generalizable, efficient, and trustworthy models through techniques like self-supervised and federated learning, multimodal data fusion, and the development of large-scale medical foundation models, ultimately aiming to empower clinicians and improve patient outcomes.

Original languageEnglish
Pages (from-to)184918-184936
Number of pages19
JournalIEEE Access
Volume13
DOIs
StatePublished - 2025
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2013 IEEE.

Keywords

  • Brain tumor classification
  • CNN
  • MRI
  • ViT
  • Vision Transformers
  • convolutional neural networks
  • deep learning
  • medical imaging
  • transformers

ASJC Scopus subject areas

  • General Computer Science
  • General Materials Science
  • General Engineering

Fingerprint

Dive into the research topics of 'From CNNs to Transformers: A Review of Evolving Deep Learning Architectures for Brain Tumor Classification'. Together they form a unique fingerprint.

Cite this