Skip to main navigation Skip to search Skip to main content

Evaluating Multi-Modal LLMs for Automatically Recognizing Semantic Elements in UML Use Case Diagram Images

  • Jameleddine Hassine*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Requirements engineering commonly employs UML Use Case Diagrams (UCD) to visually capture system interactions and functionality, facilitating clear communication between stakeholders. Recognizing and extracting semantic information from UCDs is essential for applications such as automated requirements extraction and system design validation, which improves software analysis accuracy, and streamlines model understanding for both developers and stakeholders. Recent advancements in large language models (LLMs) with visual processing capabilities enable interpreting intricate diagrammatic content. This paper evaluates multi-modal LLMs, specifically GPT-4o and GPT-4o-mini, in accurately identifying semantic elements within UCDs. We conducted experiments on a new dataset of UCDs and other diagrams collected from online sources. Experimental results show that both models struggled to accurately identify and interpret key UCD elements, often misclassifying or overlooking essential ones.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages861-866
Number of pages6
ISBN (Electronic)9798331535100
DOIs
StatePublished - 2025
Event32nd IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025 - Montreal, Canada
Duration: 4 Mar 20257 Mar 2025

Publication series

NameProceedings - 2025 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025

Conference

Conference32nd IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025
Country/TerritoryCanada
CityMontreal
Period4/03/257/03/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • Multi-Modal LLMs
  • Requirements engineering
  • UML Use Case Diagram
  • semantic information

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Evaluating Multi-Modal LLMs for Automatically Recognizing Semantic Elements in UML Use Case Diagram Images'. Together they form a unique fingerprint.

Cite this