3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation

📅 2026-03-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
High-quality annotated data are scarce in medical imaging, limiting the application of generative models for liver structure segmentation. This work proposes a label-guided 3D latent diffusion model based on ControlNet, introducing structural label guidance into 3D medical image generation for the first time. Leveraging hepatobiliary-phase Gd-EOB-DTPA-enhanced MR images and their corresponding anatomical masks, the model simultaneously synthesizes high-fidelity MR volumes and accurate segmentation labels. The method achieves a Fréchet Inception Distance (FID) of 28.31, representing improvements of 70.9% and 26.7% over conventional GANs and existing diffusion models, respectively. When employed for data augmentation, it boosts the Dice score for liver tumor segmentation by up to 11.153%, substantially enhancing downstream task performance.

Technology Category

Application Category

📝 Abstract
Deep learning and generative models are advancing rapidly, with synthetic data increasingly being integrated into training pipelines for downstream analysis tasks. However, in medical imaging, their adoption remains constrained by the scarcity of reliable annotated datasets. To address this limitation, we propose 3D-LLDM, a label-guided 3D latent diffusion model that generates high-quality synthetic magnetic resonance (MR) volumes with corresponding anatomical segmentation masks. Our approach uses hepatobiliary phase MR images enhanced with the Gd-EOB-DTPA contrast agent to derive structural masks for the liver, portal vein, hepatic vein, and hepatocellular carcinoma, which then guide volumetric synthesis through a ControlNet-based architecture. Trained on 720 real clinical hepatobiliary phase MR scans from Samsung Medical Center, 3D-LLDM achieves a Fréchet Inception Distance (FID) of 28.31, improving over GANs by 70.9% and over state-of-the-art diffusion baselines by 26.7%. When used for data augmentation, the synthetic volumes improve hepatocellular carcinoma segmentation by up to 11.153% Dice score across five CNN architectures.
Problem

Research questions and friction points this paper is trying to address.

synthetic medical imaging
data scarcity
hepatic segmentation
annotated datasets
MR image synthesis
Innovation

Methods, ideas, or system contributions that make the work stand out.

label-guided diffusion
3D latent diffusion model
synthetic MR imaging
ControlNet
medical image segmentation
K
Kyeonghun Kim
OUTTA, Republic of Korea
J
Jaehyeok Bae
Stanford University, USA
Y
Youngung Han
Seoul National University, Republic of Korea
J
Joo Young Bae
Seoul National University, Republic of Korea
S
Seoyoung Ju
OUTTA, Republic of Korea
J
Junsu Lim
OUTTA, Republic of Korea
G
Gyeongmin Kim
Chung-Ang University, Republic of Korea
N
Nam-Joon Kim
Seoul National University, Republic of Korea
W
Woo Kyoung Jeong
Samsung Medical Center, Republic of Korea
K
Ken Ying-Kai Liao
NVIDIA, Taiwan
W
Won Jae Lee
Samsung Medical Center, Republic of Korea
P
Pa Hong
Samsung Changwon Hospital, Republic of Korea
Hyuk-Jae Lee
Hyuk-Jae Lee
Seoul National University, Department of Electrical and Computer Engineering
인공지능메모리 아키텍처자율주행영상처리