Abstract
Machine learning models have been applied to various healthcare tasks. Such models include both inherently interpretable models and black-box models. In most cases, these models are capable of achieving high accuracy. It is also known that the model should also be well calibrated. Recently, the issues of algorithmic bias in clinical predictive models have attracted attention. This is because such bias would result in disparities in health care, introducing disadvantages to some subgroups of the population. The aim is to detect such disparities and then remove them afterwards. In this perspective, those predictors used by the model need to be differentiated between sensitive variables and the rest. Those sensitive variables include age, race among the others. Among these disparities, the most comprehensible one is so-called data disparities. It is known that a target population usually includes a large number of subgroups. Many of such subgroups could be quite small. When the population data is used for training a predictive model, the resulting characteristics of those outcomes will be largely dominated by a few major subgroups. On the other hand, when we fit the models with individual subgroup data, it is expected that the data in some small subgroups are not sufficient for a proper model training, thus producing disparately predicted outcomes. Most of clinical predictive models don’t include domain-specific knowledge. Causal inference allows for incorporating experts’ knowledge into the relation within the set of predictive variables. The model is referred to as causal-effect model. This approach can help mitigate those disparate outcomes from those small subgroups thanks to inclusion of domain knowledge. The principled approach is to find different but related data set. Generally, it can be done within the frame of transfer learning. Apart from the re-training approaches, domain adaptation can be used to project a number of source domains jointly to a target domain. It is expected that the resulting target domain should have sufficient data even for those small subgroups. It has been debated whether or no protected variables/characteristics (such as race and gender) should be used for clinical predictive models.
| Original language | English |
|---|---|
| Title of host publication | HCI International 2025 – Late Breaking Papers - 27th International Conference on Human-Computer Interaction, HCII 2025, Proceedings |
| Editors | Vincent G. Duffy, Qin Gao, Jia Zhou |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 375-385 |
| Number of pages | 11 |
| ISBN (Print) | 9783032130242 |
| DOIs | |
| State | Published - 2026 |
| Event | Late breaking papers from the 27th International Conference on Human-Computer Interaction, HCI International 2025 - Gothenburg, Sweden Duration: 22 Jun 2025 → 27 Jun 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 16340 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | Late breaking papers from the 27th International Conference on Human-Computer Interaction, HCI International 2025 |
|---|---|
| Country/Territory | Sweden |
| City | Gothenburg |
| Period | 22/06/25 → 27/06/25 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 10 Reduced Inequalities
Keywords
- Algorithmic Bias
- Causal Structure
- Conditional Tree Inference
- Deep Transfer Learning
- Domain Adaptation
- Health Disparities
- Hospital Readmission
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science
Fingerprint
Dive into the research topics of 'State-of-the-Art Machine Learning Models for Detecting and Mitigating Disparities in Healthcare'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver