🤖 AI Summary
Addressing the challenge of balancing 3D synthesis quality and memory efficiency in medical image generation—exacerbated by scarce annotated data—this paper proposes CRF-GAN, the first framework integrating Conditional Random Fields (CRFs) into a 3D generative adversarial network. CRF-GAN enhances anatomical consistency and clinical realism in synthetic chest CT volumes. Trained on the LUNA16 dataset, it reduces GPU memory consumption by 9.34% and accelerates training by 14.6%, while preserving high-fidelity reconstruction. Quantitative evaluation demonstrates superior performance over HA-GAN in Fréchet Inception Distance (FID: 0.047 vs. 0.061) and Maximum Mean Discrepancy (MMD: 0.084 vs. 0.086). Furthermore, a double-blind 2-alternative forced-choice (2AFC) study involving 12 radiologists confirms statistically significant preference for CRF-GAN-generated images (p = 1.93 × 10⁻⁵). This work establishes a new trade-off equilibrium between computational efficiency and clinical credibility in 3D medical image synthesis.
📝 Abstract
Introduction: Generative Adversarial Networks (GANs) are increasingly used to generate synthetic medical images, addressing the critical shortage of annotated data for training Artificial Intelligence (AI) systems. This study introduces a novel memory-efficient GAN architecture, incorporating Conditional Random Fields (CRFs) to generate high-resolution 3D medical images and evaluates its performance against the state-of-the-art hierarchical (HA)-GAN model. Materials and Methods: The CRF-GAN was trained using the open-source lung CT LUNA16 dataset. The architecture was compared to HA-GAN through a quantitative evaluation, using Frechet Inception Distance (FID) and Maximum Mean Discrepancy (MMD) metrics, and a qualitative evaluation, through a two-alternative forced choice (2AFC) test completed by a pool of 12 resident radiologists, in order to assess the realism of the generated images. Results: CRF-GAN outperformed HA-GAN with lower FID (0.047 vs. 0.061) and MMD (0.084 vs. 0.086) scores, indicating better image fidelity. The 2AFC test showed a significant preference for images generated by CRF-Gan over those generated by HA-GAN with a p-value of 1.93e-05. Additionally, CRF-GAN demonstrated 9.34% lower memory usage at 256 resolution and achieved up to 14.6% faster training speeds, offering substantial computational savings. Discussion: CRF-GAN model successfully generates high-resolution 3D medical images with non-inferior quality to conventional models, while being more memory-efficient and faster. Computational power and time saved can be used to improve the spatial resolution and anatomical accuracy of generated images, which is still a critical factor limiting their direct clinical applicability.