When more is less: Incorporating additional datasets can hurt performance by introducing spurious correlations

R Compton, L Zhang, A Puli… - Machine Learning for …, 2023 - proceedings.mlr.press
In machine learning, incorporating more data is often seen as a reliable strategy for
improving model performance; this work challenges that notion by demonstrating that the …

Improve model generalization and robustness to dataset bias with bias-regularized learning and domain-guided augmentation

Y Zhang, H Wu, H Liu, L Tong, MD Wang - arXiv preprint arXiv:1910.06745, 2019 - arxiv.org
Deep Learning has thrived on the emergence of biomedical big data. However, medical
datasets acquired at different institutions have inherent bias caused by various confounding …

Algorithmic encoding of protected characteristics in chest X-ray disease detection models

B Glocker, C Jones, M Bernhardt, S Winzeck - EBioMedicine, 2023 - thelancet.com
Background It has been rightfully emphasized that the use of AI for clinical decision making
could amplify health disparities. An algorithm may encode protected characteristics, and …

Demonstrating the risk of imbalanced datasets in chest x-ray image-based diagnostics by prototypical relevance propagation

S Gautam, MMC Höhne, S Hansen… - 2022 IEEE 19th …, 2022 - ieeexplore.ieee.org
The recent trend of integrating multi-source Chest X-Ray datasets to improve automated
diagnostics raises concerns that models learn to exploit source-specific correlations to …

CheXclusion: Fairness gaps in deep chest X-ray classifiers

L Seyyed-Kalantari, G Liu, M McDermott… - … 2021: proceedings of …, 2020 - World Scientific
Machine learning systems have received much attention recently for their ability to achieve
expert-level performance on clinical tasks, particularly in medical imaging. Here, we …

Pseudo bias-balanced learning for debiased chest x-ray classification

L Luo, D Xu, H Chen, TT Wong, PA Heng - International conference on …, 2022 - Springer
Deep learning models were frequently reported to learn from shortcuts like dataset biases.
As deep learning is playing an increasingly important role in the modern healthcare system …

Improving the fairness of chest x-ray classifiers

H Zhang, N Dullerud, K Roth… - … on health, inference …, 2022 - proceedings.mlr.press
Deep learning models have reached or surpassed human-level performance in the field of
medical imaging, especially in disease diagnosis using chest x-rays. However, prior work …

CheXternal: Generalization of deep learning models for chest X-ray interpretation to photos of chest X-rays and external clinical settings

P Rajpurkar, A Joshi, A Pareek, AY Ng… - Proceedings of the …, 2021 - dl.acm.org
Recent advances in training deep learning models have demonstrated the potential to
provide accurate chest X-ray interpretation and increase access to radiology expertise …

Learning to unlearn: Building immunity to dataset bias in medical imaging studies

A Ashraf, S Khan, N Bhagwat, M Chakravarty… - arXiv preprint arXiv …, 2018 - arxiv.org
Medical imaging machine learning algorithms are usually evaluated on a single dataset.
Although training and testing are performed on different subsets of the dataset, models built …

Assessing and mitigating the effects of class imbalance in machine learning with application to X-ray imaging

W Qu, I Balki, M Mendez, J Valen, J Levman… - International journal of …, 2020 - Springer
Purpose Machine learning (ML) algorithms are well known to exhibit variations in prediction
accuracy when provided with imbalanced training sets typically seen in medical imaging …