🤖 AI Summary
To address the scarcity of high-quality annotated data in cryo-electron microscopy (cryo-EM), this work proposes the first synthetic micrograph generation framework integrating biophysical modeling with a conditional cycle-consistent diffusion model. The method models structural diversity via multi-scale scaffold construction, incorporates mask-aware contrastive learning for spatially adaptive and physically realistic noise synthesis, and enforces cycle consistency to jointly preserve structural fidelity and imaging realism. Experiments demonstrate that the generated micrographs significantly outperform state-of-the-art methods in both particle picking and 3D reconstruction tasks, achieving substantial improvements in structural accuracy. The framework thus provides a high-fidelity, scalable source of synthetic training data for downstream cryo-EM analysis models.
📝 Abstract
Cryo-electron microscopy (cryo-EM) offers near-atomic resolution imaging of macromolecules, but developing robust models for downstream analysis is hindered by the scarcity of high-quality annotated data. While synthetic data generation has emerged as a potential solution, existing methods often fail to capture both the structural diversity of biological specimens and the complex, spatially varying noise inherent in cryo-EM imaging. To overcome these limitations, we propose CryoCCD, a synthesis framework that integrates biophysical modeling with generative techniques. Specifically, CryoCCD produces multi-scale cryo-EM micrographs that reflect realistic biophysical variability through compositional heterogeneity, cellular context, and physics-informed imaging. To generate realistic noise, we employ a conditional diffusion model, enhanced by cycle consistency to preserve structural fidelity and mask-aware contrastive learning to capture spatially adaptive noise patterns. Extensive experiments show that CryoCCD generates structurally accurate micrographs and enhances performance in downstream tasks, outperforming state-of-the-art baselines in both particle picking and reconstruction.