Pretext Matters: An Empirical Study of SSL Methods in Medical Imaging

📅 2026-03-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates how to select optimal self-supervised learning (SSL) methods based on the structural and noise characteristics of medical images to enhance the learning of clinically relevant features. We systematically compare joint embedding architectures (JEAs) and joint embedding prediction architectures (JEPAs) on ultrasound and histopathology images, establishing—for the first time—a principled framework linking SSL objectives to modality-specific properties such as localized signals versus global structure. Expert evaluation by radiologists and pathologists reveals that JEAs are better suited for localized tasks like histopathology analysis, whereas JEPAs excel in tasks requiring global structural understanding, such as liver ultrasound interpretation. This modality-aware selection significantly improves the clinical utility of the learned representations.

Technology Category

Application Category

📝 Abstract
Though self-supervised learning (SSL) has demonstrated incredible ability to learn robust representations from unlabeled data, the choice of optimal SSL strategy can lead to vastly different performance outcomes in specialized domains. Joint embedding architectures (JEAs) and joint embedding predictive architectures (JEPAs) have shown robustness to noise and strong semantic feature learning compared to pixel reconstruction-based SSL methods, leading to widespread adoption in medical imaging. However, no prior work has systematically investigated which SSL objective is better aligned with the spatial organization of clinically relevant signal. In this work, we empirically investigate how the choice of SSL method impacts the learned representations in medical imaging. We select two representative imaging modalities characterized by unique noise profiles: ultrasound and histopathology. When informative signal is spatially localized, as in histopathology, JEAs are more effective due to their view-invariance objective. In contrast, when diagnostically relevant information is globally structured, such as the macroscopic anatomy present in liver ultrasounds, JEPAs are optimal. These differences are especially evident in the clinical relevance of the learned features, as independently validated by board-certified radiologists and pathologists. Together, our results provide a framework for matching SSL objectives to the structural and noise properties of medical imaging modalities.
Problem

Research questions and friction points this paper is trying to address.

self-supervised learning
medical imaging
representation learning
spatial organization
clinical relevance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-supervised learning
Joint embedding architectures
Joint embedding predictive architectures
Medical imaging
Pretext task
🔎 Similar Papers
No similar papers found.
V
Vedrana Ivezić
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, CA, USA; Department of Radiology, University of California, Los Angeles, Los Angeles, CA, USA
M
Mara Pleasure
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, CA, USA; Department of Radiology, University of California, Los Angeles, Los Angeles, CA, USA
Ashwath Radhachandran
Ashwath Radhachandran
PhD Student, UCLA
medical informaticsmaking machines learndoing science
S
Saarang Panchavati
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, CA, USA
Shreeram Athreya
Shreeram Athreya
PhD Student, UCLA
Biomedical image processingComputer visionComputational ImagingMedical Devices
V
Vivek Sant
Division of Endocrine Surgery, University of Texas Southwestern Medical Center, Dallas, TX, USA
B
Benjamin Emert
Department of Pathology, University of California, Los Angeles, Los Angeles, CA, USA
G
Gregory Fishbein
Department of Pathology, University of California, Los Angeles, Los Angeles, CA, USA
C
Corey Arnold
Biomedical AI Research Lab, University of California, Los Angeles, Los Angeles, CA, USA; Department of Radiology, University of California, Los Angeles, Los Angeles, CA, USA; Department of Pathology, University of California, Los Angeles, Los Angeles, CA, USA; Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, USA
William Speier
William Speier
UCLA
Machine learningbrain-computer interfacesmedical image analysis