Skip to main navigation Skip to search Skip to main content

State-of-the-Art Machine Learning Models for Detecting and Mitigating Disparities in Healthcare

  • Yuan Shen*
  • , Mufti Mahmud
  • , Teena Rai
  • , David J. Brown
  • , Jun He
  • , Muhammad Arifur Rahman
  • , David R. Baldwin
  • , Jaspreet Kaur
  • , Emma O’Dowd
  • , Richard B. Hubbard
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Machine learning models have been applied to various healthcare tasks. Such models include both inherently interpretable models and black-box models. In most cases, these models are capable of achieving high accuracy. It is also known that the model should also be well calibrated. Recently, the issues of algorithmic bias in clinical predictive models have attracted attention. This is because such bias would result in disparities in health care, introducing disadvantages to some subgroups of the population. The aim is to detect such disparities and then remove them afterwards. In this perspective, those predictors used by the model need to be differentiated between sensitive variables and the rest. Those sensitive variables include age, race among the others. Among these disparities, the most comprehensible one is so-called data disparities. It is known that a target population usually includes a large number of subgroups. Many of such subgroups could be quite small. When the population data is used for training a predictive model, the resulting characteristics of those outcomes will be largely dominated by a few major subgroups. On the other hand, when we fit the models with individual subgroup data, it is expected that the data in some small subgroups are not sufficient for a proper model training, thus producing disparately predicted outcomes. Most of clinical predictive models don’t include domain-specific knowledge. Causal inference allows for incorporating experts’ knowledge into the relation within the set of predictive variables. The model is referred to as causal-effect model. This approach can help mitigate those disparate outcomes from those small subgroups thanks to inclusion of domain knowledge. The principled approach is to find different but related data set. Generally, it can be done within the frame of transfer learning. Apart from the re-training approaches, domain adaptation can be used to project a number of source domains jointly to a target domain. It is expected that the resulting target domain should have sufficient data even for those small subgroups. It has been debated whether or no protected variables/characteristics (such as race and gender) should be used for clinical predictive models.

Original languageEnglish
Title of host publicationHCI International 2025 – Late Breaking Papers - 27th International Conference on Human-Computer Interaction, HCII 2025, Proceedings
EditorsVincent G. Duffy, Qin Gao, Jia Zhou
PublisherSpringer Science and Business Media Deutschland GmbH
Pages375-385
Number of pages11
ISBN (Print)9783032130242
DOIs
StatePublished - 2026
EventLate breaking papers from the 27th International Conference on Human-Computer Interaction, HCI International 2025 - Gothenburg, Sweden
Duration: 22 Jun 202527 Jun 2025

Publication series

NameLecture Notes in Computer Science
Volume16340 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceLate breaking papers from the 27th International Conference on Human-Computer Interaction, HCI International 2025
Country/TerritorySweden
CityGothenburg
Period22/06/2527/06/25

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 10 - Reduced Inequalities
    SDG 10 Reduced Inequalities

Keywords

  • Algorithmic Bias
  • Causal Structure
  • Conditional Tree Inference
  • Deep Transfer Learning
  • Domain Adaptation
  • Health Disparities
  • Hospital Readmission

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'State-of-the-Art Machine Learning Models for Detecting and Mitigating Disparities in Healthcare'. Together they form a unique fingerprint.

Cite this