Debiasing Counterfactuals in the Presence of Spurious Correlations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14242))

Included in the following conference series:

687 Accesses

Abstract

Deep learning models can perform well in complex medical imaging classification tasks, even when basing their conclusions on spurious correlations (i.e. confounders), should they be prevalent in the training dataset, rather than on the causal image markers of interest. This would thereby limit their ability to generalize across the population. Explainability based on counterfactual image generation can be used to expose the confounders but does not provide a strategy to mitigate the bias. In this work, we introduce the first end-to-end training framework that integrates both (i) popular debiasing classifiers (e.g. distributionally robust optimization (DRO)) to avoid latching onto the spurious correlations and (ii) counterfactual image generation to unveil generalizable imaging markers of relevance to the task. Additionally, we propose a novel metric, Spurious Correlation Latching Score (SCLS), to quantify the extent of the classifier reliance on the spurious correlation as exposed by the counterfactual images. Through comprehensive experiments on two public datasets (with the simulated and real visual artifacts), we demonstrate that the debiasing method: (i) learns generalizable markers across the population, and (ii) successfully ignores spurious correlations and focuses on the underlying disease pathology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Flexible Framework for Simulating and Evaluating Biases in Deep Learning-Based Medical Image Analysis

Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis

Article Open access 22 June 2024

Causality matters in medical imaging

Article Open access 22 July 2020

References

Burlina, P., Joshi, N., Paul, W., Pacheco, K.D., Bressler, N.M.: Addressing artificial intelligence bias in retinal diagnostics. Transl. Vis. Sci. Technol. 10(2), 13 (2021)
Article Google Scholar
Cohen, J.P., et al.: Gifsplanation via latent shift: a simple autoencoder approach to counterfactual generation for chest X-rays. In: Medical Imaging with Deep Learning, pp. 74–104. PMLR (2021)
Google Scholar
DeGrave, A.J., Janizek, J.D., Lee, S.I.: AI for radiographic COVID-19 detection selects shortcuts over signal. Nat. Mach. Intell. 3(7), 610–619 (2021)
Article Google Scholar
Hore, A., Ziou, D.: Image quality metrics: PSNR vs. SSIM. In: 2010 20th International Conference on Pattern Recognition, pp. 2366–2369. IEEE (2010)
Google Scholar
Irvin, J., et al.: CheXpert: A large chest radiograph dataset with uncertainty labels and expert comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 590–597 (2019)
Google Scholar
Jiang, H., et al.: A multi-label deep learning model with interpretable grad-cam for diabetic retinopathy classification. In: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 1560–1563. IEEE (2020)
Google Scholar
Kumar, A., et al.: Counterfactual image synthesis for discovery of personalized predictive image markers. In: Kakileti, S.T., et al. (eds.) MIABID AIIIMA 2022 2022. LNCS, vol. 13602, pp. 113–124. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-19660-7_11
Chapter Google Scholar
Larrazabal, A.J., Nieto, N., Peterson, V., Milone, D.H., Ferrante, E.: Gender imbalance in medical imaging datasets produces biased classifiers for computer-aided diagnosis. Proc. Natl. Acad. Sci. 117(23), 12592–12594 (2020)
Article Google Scholar
Light, R.W.: Pleural effusion. N. Engl. J. Med. 346(25), 1971–1977 (2002)
Article Google Scholar
Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Google Scholar
Magesh, P.R., Myloth, R.D., Tom, R.J.: An explainable machine learning model for early detection of Parkinson’s disease using LIME on DaTSCAN imagery. Comput. Biol. Med. 126, 104041 (2020)
Article Google Scholar
Mehta, R., Shui, C., Arbel, T.: Evaluating the fairness of deep learning uncertainty estimates in medical image analysis. In: Medical Imaging with Deep Learning (2023)
Google Scholar
Mertes, S., Huber, T., Weitz, K., Heimerl, A., André, E.: GANterfactual-counterfactual explanations for medical non-experts using generative adversarial learning. Front. Artif. Intell. 5, 825565 (2022)
Article Google Scholar
Mothilal, R.K., Sharma, A., Tan, C.: Explaining machine learning classifiers through diverse counterfactual explanations. In: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pp. 607–617 (2020)
Google Scholar
Nemirovsky, D., Thiebaut, N., Xu, Y., Gupta, A.: CounteRGAN: Generating realistic counterfactuals with residual generative adversarial nets. arXiv preprint arXiv:2009.05199 (2020)
Panwar, H., Gupta, P., Siddiqui, M.K., Morales-Menendez, R., Bhardwaj, P., Singh, V.: A deep learning and grad-CAM based color visualization approach for fast detection of COVID-19 cases using chest X-ray and CT-scan images. Chaos, Solitons Fractals 140, 110190 (2020)
Article MathSciNet Google Scholar
Ricci Lara, M.A., Echeveste, R., Ferrante, E.: Addressing fairness in artificial intelligence for medical imaging. Nat. Commun. 13(1), 4581 (2022)
Article Google Scholar
Sagawa, S., Koh, P.W., Hashimoto, T.B., Liang, P.: Distributionally robust neural networks for group shifts: on the importance of regularization for worst-case generalization. In: International Conference on Learning Representations (2019)
Google Scholar
Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 618–626 (2017)
Google Scholar
Shui, C., Szeto, J., Mehta, R., Arnold, D., Arbel, T.: Mitigating calibration bias without fixed attribute grouping for improved fairness in medical imaging analysis. arXiv preprint arXiv:2307.01738 (2023)
Singla, S., Eslami, M., Pollack, B., Wallace, S., Batmanghelich, K.: Explaining the black-box smoothly-a counterfactual approach. Med. Image Anal. 84, 102721 (2023)
Article Google Scholar
Targ, S., Almeida, D., Lyman, K.: Resnet in resnet: Generalizing residual architectures. arXiv preprint arXiv:1603.08029 (2016)
Thiagarajan, J.J., Thopalli, K., Rajan, D., Turaga, P.: Training calibration-based counterfactual explainers for deep learning models in medical image analysis. Sci. Rep. 12(1), 597 (2022)
Article Google Scholar
Vapnik, V.: Principles of risk minimization for learning theory. In: Advances in Neural Information Processing Systems, vol. 4 (1991)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar
Zong, Y., Yang, Y., Hospedales, T.: MEDFAIR: benchmarking fairness for medical imaging. In: International Conference on Learning Representations (2023)
Google Scholar
Zou, J., Schiebinger, L.: AI can be sexist and racist-it’s time to make it fair (2018)
Google Scholar

Download references

Acknowledgements

The authors are grateful for funding provided by the Natural Sciences and Engineering Research Council of Canada, the Canadian Institute for Advanced Research (CIFAR) Artificial Intelligence Chairs program, the Mila - Quebec AI Institute technology transfer program, Microsoft Research, Calcul Quebec, and the Digital Research Alliance of Canada. S.A. Tsaftaris acknowledges the support of Canon Medical and the Royal Academy of Engineering and the Research Chairs and Senior Research Fellowships scheme (grant RCSRF1819 / 8 / 25), and the UK’s Engineering and Physical Sciences Research Council (EPSRC) support via grant EP/X017680/1.

Author information

Authors and Affiliations

Center for Intelligent Machines, McGill University, Montreal, Canada
Amar Kumar, Nima Fathi, Raghav Mehta, Brennan Nichyporuk & Tal Arbel
MILA (Quebec AI institute), Montreal, Canada
Amar Kumar, Nima Fathi, Raghav Mehta, Brennan Nichyporuk, Jean-Pierre R. Falet & Tal Arbel
Montreal Neurological Institute, McGill University, Montreal, Canada
Jean-Pierre R. Falet
Institute for Digital Communications, School of Engineering, University of Edinburgh, Edinburgh, UK
Sotirios Tsaftaris
The Alan Turing Institute, London, UK
Sotirios Tsaftaris

Authors

Amar Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Nima Fathi
View author publications
You can also search for this author in PubMed Google Scholar
Raghav Mehta
View author publications
You can also search for this author in PubMed Google Scholar
Brennan Nichyporuk
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Pierre R. Falet
View author publications
You can also search for this author in PubMed Google Scholar
Sotirios Tsaftaris
View author publications
You can also search for this author in PubMed Google Scholar
Tal Arbel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amar Kumar .

Editor information

Editors and Affiliations

Fraunhofer-Institute for Computer Graphics Research (IGD), Darmstadt, Germany
Stefan Wesarg
King's College London, London, UK
Esther Puyol Antón
Université de Rennes, Rennes, France
John S. H. Baxter
Singapore, Singapore
Marius Erdt
Aachen University of Applied Sciences, Aachen, Germany
Klaus Drechsler
Fraunhofer-Institute for Computer Graphics Research (IGD), Darmstadt, Germany
Cristina Oyarzun Laura
Technion – Israel Institute of Technology, Haifa, Israel
Moti Freiman
Tongji University, Shanghai, China
Yufei Chen
Imperial College London, London, UK
Islem Rekik
Western University, London, ON, Canada
Roy Eagleson
Technical University of Denmark, Kgs Lyngby, Denmark
Aasa Feragen
King's College London, London, UK
Andrew P. King
University of Copenhagen, Copenhagen, Denmark
Veronika Cheplygina
University of Copenhagen, Copenhagen, Denmark
Melani Ganz-Benjaminsen
Universidad Nacional del Litoral, Santa Fe, Argentina
Enzo Ferrante
Imperial College London, London, UK
Ben Glocker
Vanderbilt University, Nashville, TN, USA
Daniel Moyer
Technical University of Denmark, Kgs. Lyngby, Denmark
Eikel Petersen

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 304 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumar, A. et al. (2023). Debiasing Counterfactuals in the Presence of Spurious Correlations. In: Wesarg, S., et al. Clinical Image-Based Procedures, Fairness of AI in Medical Imaging, and Ethical and Philosophical Issues in Medical Imaging. CLIP EPIMI FAIMI 2023 2023 2023. Lecture Notes in Computer Science, vol 14242. Springer, Cham. https://doi.org/10.1007/978-3-031-45249-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-031-45249-9_27
Published: 09 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45248-2
Online ISBN: 978-3-031-45249-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Debiasing Counterfactuals in the Presence of Spurious Correlations

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Flexible Framework for Simulating and Evaluating Biases in Deep Learning-Based Medical Image Analysis

Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis

Causality matters in medical imaging

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 304 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Debiasing Counterfactuals in the Presence of Spurious Correlations

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Flexible Framework for Simulating and Evaluating Biases in Deep Learning-Based Medical Image Analysis

Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis

Causality matters in medical imaging

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 304 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation