FSRF: Factorization-guided Semantic Recovery for Incomplete Multimodal Sentiment Analysis

📅 2025-10-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current multimodal sentiment analysis (MSA) methods suffer from poor generalization under modality missing—caused by occlusion, privacy constraints, or device failure. To address this, we propose a factorization-guided semantic recovery framework. Our method introduces: (1) a redundancy-free homogeneous–heterogeneous factorization module that disentangles cross-modal shared semantics from modality-specific representations while suppressing noise; and (2) a distribution-aligned self-distillation mechanism enabling bidirectional knowledge transfer and robust reconstruction of missing modalities. Integrating factorization networks, representation-constrained learning, and distribution alignment, the framework achieves state-of-the-art performance on CMU-MOSEI and CH-SIMS. Notably, it demonstrates significant gains under uncertain modality missing scenarios, validating both its effectiveness and strong generalization capability.

Technology Category

Application Category

📝 Abstract
In recent years, Multimodal Sentiment Analysis (MSA) has become a research hotspot that aims to utilize multimodal data for human sentiment understanding. Previous MSA studies have mainly focused on performing interaction and fusion on complete multimodal data, ignoring the problem of missing modalities in real-world applications due to occlusion, personal privacy constraints, and device malfunctions, resulting in low generalizability. To this end, we propose a Factorization-guided Semantic Recovery Framework (FSRF) to mitigate the modality missing problem in the MSA task. Specifically, we propose a de-redundant homo-heterogeneous factorization module that factorizes modality into modality-homogeneous, modality-heterogeneous, and noisy representations and design elaborate constraint paradigms for representation learning. Furthermore, we design a distribution-aligned self-distillation module that fully recovers the missing semantics by utilizing bidirectional knowledge transfer. Comprehensive experiments on two datasets indicate that FSRF has a significant performance advantage over previous methods with uncertain missing modalities.
Problem

Research questions and friction points this paper is trying to address.

Addressing modality missing in multimodal sentiment analysis
Recovering incomplete semantics through factorization and distillation
Improving model robustness under uncertain missing modalities
Innovation

Methods, ideas, or system contributions that make the work stand out.

Factorization separates modality into homogeneous and heterogeneous representations
Distribution-aligned self-distillation recovers missing semantics bidirectionally
Constraint paradigms guide representation learning for incomplete multimodal data
🔎 Similar Papers
No similar papers found.
Ziyang Liu
Ziyang Liu
Research Fellow, Harvard Medical School; PhD, Tsinghua University
AI4BioGraph EmbeddingLarge Language Model
P
Pengjunfei Chu
School of Advanced Manufacturing Engineering, Hefei University, Hefei, China
S
Shuming Dong
College of Global Talents, BITZH, Beijing Institute of Technology, Zhuhai, Zhuhai, China
C
Chen Zhang
School of Future Science and Engineering, Soochow University, Suzhou, China
Mingcheng Li
Mingcheng Li
Fudan University
J
Jin Wang
School of Future Science and Engineering, Soochow University, Suzhou, China