Automated Handling of Anaphoric Ambiguity in Requirements: A Multi-solution Study

  • Saad Ezzini
  • , Sallam Abualhaija
  • , Chetan Arora
  • , Mehrdad Sabetzadeh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

53 Scopus citations

Abstract

Ambiguity is a pervasive issue in natural-language requirements. A common source of ambiguity in requirements is when a pronoun is anaphoric. In requirements engineering, anaphoric ambiguity occurs when a pronoun can plausibly refer to different entities and thus be interpreted differently by different readers. In this paper, we develop an accurate and practical automated approach for handling anaphoric ambiguity in requirements, addressing both ambiguity detection and anaphora interpretation. In view of the multiple competing natural language processing (NLP) and machine learning (ML) technologies that one can utilize, we simultaneously pursue six alternative solutions, empirically assessing each using a col-lection of ˜1,350 industrial requirements. The alternative solution strategies that we consider are natural choices induced by the existing technologies; these choices frequently arise in other automation tasks involving natural-language requirements. A side-by-side em-pirical examination of these choices helps develop insights about the usefulness of different state-of-the-art NLP and ML technologies for addressing requirements engineering problems. For the ambigu-ity detection task, we observe that supervised ML outperforms both a large-scale language model, SpanBERT (a variant of BERT), as well as a solution assembled from off-the-shelf NLP coreference re-solvers. In contrast, for anaphora interpretation, SpanBERT yields the most accurate solution. In our evaluation, (1) the best solution for anaphoric ambiguity detection has an average precision of ˜60% and a recall of 100%, and (2) the best solution for anaphora interpretation (resolution) has an average success rate of ˜98%.

Original languageEnglish
Title of host publicationProceedings - 2022 ACM/IEEE 44th International Conference on Software Engineering, ICSE 2022
PublisherIEEE Computer Society
Pages187-199
Number of pages13
ISBN (Electronic)9781450392211
DOIs
StatePublished - 5 Jul 2022
Externally publishedYes
Event44th ACM/IEEE International Conference on Software Engineering, ICSE 2022 - Hybrid, Pittsburgh, United States
Duration: 22 May 202227 May 2022

Publication series

NameProceedings - International Conference on Software Engineering
Volume2022-May
ISSN (Print)0270-5257

Conference

Conference44th ACM/IEEE International Conference on Software Engineering, ICSE 2022
Country/TerritoryUnited States
CityHybrid, Pittsburgh
Period22/05/2227/05/22

Bibliographical note

Publisher Copyright:
© 2022 ACM.

Keywords

  • Ambiguity
  • BERT
  • Language Models
  • Machine Learning (ML)
  • Natural Language Processing (NLP)
  • Natural-language Requirements
  • Requirements Engineering

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Automated Handling of Anaphoric Ambiguity in Requirements: A Multi-solution Study'. Together they form a unique fingerprint.

Cite this