Assessing the adversarial robustness of multimodal medical AI systems: insights into vulnerabilities and modality interactions

  • Ekaterina Mozhegova
  • , Asad Masood Khattak*
  • , Adil Khan
  • , Roman Garaev
  • , Bader Rasheed
  • , Muhammad Shahid Anwar
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The emergence of both task-specific single-modality models and general-purpose multimodal large models presents new opportunities, but also introduces challenges, particularly regarding adversarial attacks. In high-stakes domains like healthcare, these attacks can severely undermine model reliability and their applicability in real-world scenarios, highlighting the critical need for research focused on adversarial robustness. This study investigates the behavior of multimodal models under various adversarial attack scenarios. We conducted experiments involving two modalities: images and texts. Our findings indicate that multimodal models exhibit enhanced resilience against adversarial attacks compared to their single-modality counterparts. This supports our hypothesis that the integration of multiple modalities contributes positively to the robustness of deep learning systems. The results of this research advance understanding in the fields of multimodality and adversarial robustness and suggest new avenues for future studies focused on optimizing data flow within multimodal systems.

Original languageEnglish
Article number1606238
JournalFrontiers in Medicine
Volume12
DOIs
StatePublished - 2025

Bibliographical note

Publisher Copyright:
Copyright © 2025 Mozhegova, Khattak, Khan, Garaev, Rasheed and Anwar.

Keywords

  • X-ray
  • adversarial attack
  • classification
  • machine learning (ML)
  • multimodal data fusion

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'Assessing the adversarial robustness of multimodal medical AI systems: insights into vulnerabilities and modality interactions'. Together they form a unique fingerprint.

Cite this