Generalizable Engagement Estimation in Conversation via Domain Prompting and Parallel Attention

📅 2025-08-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Dialogue engagement estimation faces challenges including poor cross-domain generalization, limited adaptability to cross-cultural and multilingual settings, and difficulty in modeling interactive dynamics—hindering robust deployment of human–computer interaction systems. To address these, we propose a domain-adaptive engagement modeling framework: (1) a domain prompt mechanism that employs learnable, domain-specific vectors to guide input representation learning; and (2) a parallel cross-attention module integrating forward and backward BiLSTMs to jointly model interlocutors’ reactive behaviors and anticipatory state alignment. Evaluated on multiple cross-cultural and multilingual benchmarks, our method achieves significant gains in generalization performance—yielding an absolute improvement of 0.45 in Concordance Correlation Coefficient (CCC) on the NoXi-J test set. Furthermore, it secured first place in the MultiMediate’25 Multidomain Engagement Estimation Challenge.

Technology Category

Application Category

📝 Abstract
Accurate engagement estimation is essential for adaptive human-computer interaction systems, yet robust deployment is hindered by poor generalizability across diverse domains and challenges in modeling complex interaction dynamics.To tackle these issues, we propose DAPA (Domain-Adaptive Parallel Attention), a novel framework for generalizable conversational engagement modeling. DAPA introduces a Domain Prompting mechanism by prepending learnable domain-specific vectors to the input, explicitly conditioning the model on the data's origin to facilitate domain-aware adaptation while preserving generalizable engagement representations. To capture interactional synchrony, the framework also incorporates a Parallel Cross-Attention module that explicitly aligns reactive (forward BiLSTM) and anticipatory (backward BiLSTM) states between participants.Extensive experiments demonstrate that DAPA establishes a new state-of-the-art performance on several cross-cultural and cross-linguistic benchmarks, notably achieving an absolute improvement of 0.45 in Concordance Correlation Coefficient (CCC) over a strong baseline on the NoXi-J test set. The superiority of our method was also confirmed by winning the first place in the Multi-Domain Engagement Estimation Challenge at MultiMediate'25.
Problem

Research questions and friction points this paper is trying to address.

Improving cross-domain generalization in engagement estimation
Modeling complex interaction dynamics in conversations
Addressing poor generalizability across diverse cultural contexts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Domain Prompting with learnable vectors
Parallel Cross-Attention for interaction synchrony
BiLSTM alignment of reactive and anticipatory states
🔎 Similar Papers
No similar papers found.
Y
Yangchen Yu
School of Computer Science and Information Engineering, Hefei University of Technology
Yin Chen
Yin Chen
Lecturer in Mathematics at University of Saskatchewan
Invariant theoryLie theoryCommutative algebraApplied algebraic geometry
J
Jia Li
School of Computer Science and Information Engineering, Hefei University of Technology
P
Peng Jia
School of Computer Science and Information Engineering, Hefei University of Technology
Y
Yu Zhang
School of Computer Science and Information Engineering, Hefei University of Technology
L
Li Dai
School of Computer Science and Information Engineering, Hefei University of Technology
Zhenzhen Hu
Zhenzhen Hu
Hefei University of Technology
Multimedia
M
Meng Wang
School of Computer Science and Information Engineering, Hefei University of Technology
Richang Hong
Richang Hong
Hefei University of Technology
MultimediaPattern Recognition