DeFusion: An Effective Decoupling Fusion Network for Multi-Modal Pregnancy Prediction

📅 2025-01-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current IVF pregnancy prediction models struggle to effectively integrate sequential embryo images with parental fertility tabular data, leading to suboptimal accuracy. To address this, we propose a multimodal disentangled fusion framework: (1) a disentanglement module explicitly separates modality-shared and modality-specific features; (2) spatiotemporal positional encoding captures embryonic developmental dynamics; (3) a Tabular Transformer extracts structured representations from clinical fertility metrics; and (4) cross-modal alignment enables deep feature integration. Evaluated on a new, large-scale dataset of 4,046 IVF cases from Southern Medical University, our method significantly outperforms state-of-the-art approaches. Furthermore, when transferred to an ophthalmic disease prediction task—without architectural modification—it maintains strong generalization performance, demonstrating the framework’s universality and robustness across medical domains.

Technology Category

Application Category

📝 Abstract
Temporal embryo images and parental fertility table indicators are both valuable for pregnancy prediction in extbf{in vitro fertilization embryo transfer} (IVF-ET). However, current machine learning models cannot make full use of the complementary information between the two modalities to improve pregnancy prediction performance. In this paper, we propose a Decoupling Fusion Network called DeFusion to effectively integrate the multi-modal information for IVF-ET pregnancy prediction. Specifically, we propose a decoupling fusion module that decouples the information from the different modalities into related and unrelated information, thereby achieving a more delicate fusion. And we fuse temporal embryo images with a spatial-temporal position encoding, and extract fertility table indicator information with a table transformer. To evaluate the effectiveness of our model, we use a new dataset including 4046 cases collected from Southern Medical University. The experiments show that our model outperforms state-of-the-art methods. Meanwhile, the performance on the eye disease prediction dataset reflects the model's good generalization. Our code and dataset are available at https://github.com/Ou-Young-1999/DFNet.
Problem

Research questions and friction points this paper is trying to address.

In Vitro Fertilization
Predictive Modeling
Data Fusion
Innovation

Methods, ideas, or system contributions that make the work stand out.

DeFusion
Temporal Embryo Imaging
IVF Success Prediction
🔎 Similar Papers
No similar papers found.
Xueqiang Ouyang
Xueqiang Ouyang
Information Department, The People’s Hospital of Baoan Shenzhen
Medical artificial intelligence
J
Jia Wei
School of Computer Science and Engineering, South China University of Technology, Guangzhou, China
W
Wenjie Huo
Department of Obstetrics and Gynecology, Nanfang Hospital, Southern Medical University, Guangzhou, China
X
Xiaocong Wang
Department of Obstetrics and Gynecology, Nanfang Hospital, Southern Medical University, Guangzhou, China
R
Rui Li
Golisano College of Computing and Information Sciences, Rochester Institute of Technology, Rochester, NY 14623, USA
Jianlong Zhou
Jianlong Zhou
University of Technology Sydney (UTS)
AI EthicsAI FairnessAI ExplainabilityHuman Centred AIHuman Computer Interaction