Beyond Task-Oriented and Chitchat Dialogues: Proactive and Transition-Aware Conversational Agents

📅 2025-11-11
🏛️ Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Real-world dialogues often require natural transitions between task-oriented (TOD) and open-domain chit-chat modes, yet existing systems lack both modeling and evaluation capabilities for dynamic mode switching. To address this, we introduce TACT—the first benchmark supporting bidirectional, user- or system-initiated mode switching—and propose two novel evaluation metrics: Switch (measuring switching accuracy) and Recovery (assessing resilience after erroneous mode transitions). This enables the first systematic, transition-aware modeling and evaluation of multi-directional dialogue mode migration. Leveraging TACT for training with Direct Preference Optimization (DPO), our model achieves 75.74% accuracy on joint mode-intent recognition. In human evaluations, it outperforms GPT-4o with a 70.1% win rate, significantly surpassing all baselines.

Technology Category

Application Category

📝 Abstract
Conversational agents have traditionally been developed for either task-oriented dialogue (TOD) or open-ended chitchat, with limited progress in unifying the two. Yet, real-world conversations naturally involve fluid transitions between these modes. To address this gap, we introduce TACT (TOD-And-Chitchat Transition), a dataset designed for transition-aware dialogue modeling that incorporates structurally diverse and integrated mode flows. TACT supports both user- and agent-driven mode switches, enabling robust modeling of complex conversational dynamics. To evaluate an agent's ability to initiate and recover from mode transitions, we propose two new metrics -- Switch and Recovery. Models trained on TACT outperform baselines in both intent detection and mode transition handling. Moreover, applying Direct Preference Optimization (DPO) to TACT-trained models yields additional gains, achieving 75.74% joint mode-intent accuracy and a 70.1% win rate against GPT-4o in human evaluation. These results demonstrate that pairing structurally diverse data with DPO enhances response quality and transition control, paving the way for more proactive and transition-aware conversational agents.
Problem

Research questions and friction points this paper is trying to address.

Unifying task-oriented and chitchat dialogues with fluid transitions
Modeling both user-driven and agent-driven conversational mode switches
Enhancing intent detection and transition handling in dialogue systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dataset TACT enables TOD-chitchat transition modeling
Introduces Switch and Recovery metrics for evaluation
Applies DPO to enhance response quality and transitions
🔎 Similar Papers
No similar papers found.
Y
Yejin Yoon
Hanyang University
Yuri Son
Yuri Son
Hanyang University
N
Namyeong So
Hanyang University
M
Minseo Kim
Hanyang University
M
Minsoo Cho
Hyundai Motor Company
C
Chanhee Park
Hyundai Motor Company
S
Seungshin Lee
Hyundai Motor Company
Taeuk Kim
Taeuk Kim
Assistant Professor, Hanyang University.
Natural Language ProcessingLarge Language ModelsMachine Learning