Medical Imaging AI Competitions Lack Fairness

📅 2025-12-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Medical imaging AI challenges suffer from two systemic fairness deficiencies: insufficient clinical representativeness and poor FAIR (Findable, Accessible, Interoperable, Reusable) compliance. To address this, we conducted the first large-scale systematic assessment of 241 international challenges, introducing a novel dual-dimensional fairness evaluation framework: (1) quantitative assessment of data representativeness—measuring geographic, modality, and disease diversity; and (2) audit of FAIR adherence—evaluating access policies, license legality, and documentation completeness. Leveraging systematic review, meta-analysis, and cross-modal/geographic statistical modeling, we uncovered pronounced geographic and modality biases; over 60% of datasets impose restrictive or ambiguous access terms; 48% lack compliant licenses, and 49% exhibit incomplete documentation. This study provides the first empirical evidence of widespread fairness deficits in medical imaging challenges, delivering an actionable assessment toolkit and concrete improvement pathways for challenge design, data governance, and clinical translation.

Technology Category

Application Category

📝 Abstract
Benchmarking competitions are central to the development of artificial intelligence (AI) in medical imaging, defining performance standards and shaping methodological progress. However, it remains unclear whether these benchmarks provide data that are sufficiently representative, accessible, and reusable to support clinically meaningful AI. In this work, we assess fairness along two complementary dimensions: (1) whether challenge datasets are representative of real-world clinical diversity, and (2) whether they are accessible and legally reusable in line with the FAIR principles. To address this question, we conducted a large-scale systematic study of 241 biomedical image analysis challenges comprising 458 tasks across 19 imaging modalities. Our findings show substantial biases in dataset composition, including geographic location, modality-, and problem type-related biases, indicating that current benchmarks do not adequately reflect real-world clinical diversity. Despite their widespread influence, challenge datasets were frequently constrained by restrictive or ambiguous access conditions, inconsistent or non-compliant licensing practices, and incomplete documentation, limiting reproducibility and long-term reuse. Together, these shortcomings expose foundational fairness limitations in our benchmarking ecosystem and highlight a disconnect between leaderboard success and clinical relevance.
Problem

Research questions and friction points this paper is trying to address.

Medical imaging AI competitions lack fairness in dataset representativeness.
They face accessibility and legal reusability issues under FAIR principles.
Biases and constraints limit clinical relevance and reproducibility of benchmarks.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Systematically assessing dataset representativeness and FAIR compliance
Identifying geographic, modality, and problem type biases in benchmarks
Highlighting restrictive access and licensing as reproducibility barriers
🔎 Similar Papers
No similar papers found.
A
Annika Reinke
German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany
E
Evangelia Christodoulou
German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany
S
Sthuthi Sadananda
Digital Oncology Program, National Center for Tumor Diseases (NCT) Heidelberg, Germany
A
A. Emre Kavur
German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany
K
Khrystyna Faryna
Department of Pathology, Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
D
Daan Schouten
Department of Pathology, Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
B
Bennett A. Landman
Vanderbilt University, Nashville, Tennessee, USA
Carole Sudre
Carole Sudre
University College London
Olivier Colliot
Olivier Colliot
Research Director at CNRS, ARAMIS Lab
machine learningimage analysismedical imagingneuroimagingbrain disorders
N
Nick Heller
Department of Urology, Cleveland Clinic, Cleveland, USA
Sophie Loizillon
Sophie Loizillon
Postdoctoral Researcher, ARAMIS Lab, Paris Brain Institute (ICM), Sorbonne Université
Medical ImagingMachine LearningQuality Control
M
Martin Maška
Centre for Biomedical Image Analysis, Faculty of Informatics, Masaryk University, Brno, Czech Republic
Maëlys Solal
Maëlys Solal
Sorbonne Université, Institut du Cerveau – Paris Brain Institute – ICM, CNRS, Inria, Inserm, AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
A
Arya Yazdan-Panah
Sorbonne Université, Institut du Cerveau – Paris Brain Institute – ICM, CNRS, Inria, Inserm, AP-HP, Hôpital de la Pitié-Salpétrière, Paris, France
V
Vilma Bozgo
Department of Pathology, Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
Ömer Sümer
Ömer Sümer
German Cancer Research Center (DKFZ)
Surgical Data ScienceMedical Image AnalysisComputer VisionMachine Learning
S
Siem de Jong
Department of Pathology, Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
Sophie Fischer
Sophie Fischer
Department of Computer Science, University of Oxford, Oxford, United Kingdom
Michal Kozubek
Michal Kozubek
Masarykova univerzita
analýza obrazu
T
Tim Rädsch
German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany
N
Nadim Hammoud
German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany
F
Fruzsina Molnár-Gábor
Faculty of Law / BioQuant, Heidelberg University, Heidelberg, Germany
Steven Hicks
Steven Hicks
Senior Research Scientist at SimulaMet
Machine LearningDeep LearningExplainable AI
M
Michael A. Riegler
Simula Research Laboratory, Oslo, Norway
A
Anindo Saha
Department of Pathology, Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands