Abstract
Requirements engineering commonly employs UML Use Case Diagrams (UCD) to visually capture system interactions and functionality, facilitating clear communication between stakeholders. Recognizing and extracting semantic information from UCDs is essential for applications such as automated requirements extraction and system design validation, which improves software analysis accuracy, and streamlines model understanding for both developers and stakeholders. Recent advancements in large language models (LLMs) with visual processing capabilities enable interpreting intricate diagrammatic content. This paper evaluates multi-modal LLMs, specifically GPT-4o and GPT-4o-mini, in accurately identifying semantic elements within UCDs. We conducted experiments on a new dataset of UCDs and other diagrams collected from online sources. Experimental results show that both models struggled to accurately identify and interpret key UCD elements, often misclassifying or overlooking essential ones.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 861-866 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331535100 |
| DOIs | |
| State | Published - 2025 |
| Event | 32nd IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025 - Montreal, Canada Duration: 4 Mar 2025 → 7 Mar 2025 |
Publication series
| Name | Proceedings - 2025 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025 |
|---|
Conference
| Conference | 32nd IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025 |
|---|---|
| Country/Territory | Canada |
| City | Montreal |
| Period | 4/03/25 → 7/03/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Keywords
- Multi-Modal LLMs
- Requirements engineering
- UML Use Case Diagram
- semantic information
ASJC Scopus subject areas
- Hardware and Architecture
- Software
- Safety, Risk, Reliability and Quality
Fingerprint
Dive into the research topics of 'Evaluating Multi-Modal LLMs for Automatically Recognizing Semantic Elements in UML Use Case Diagram Images'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver