Beyond Task-Oriented and Chitchat Dialogues: Proactive and Transition-Aware Conversational Agents

📅 2025-11-11

🏛️ Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

📈 Citations: 0

✨ Influential: 0

career value

193K/year

🤖 AI Summary

Real-world dialogues often require natural transitions between task-oriented (TOD) and open-domain chit-chat modes, yet existing systems lack both modeling and evaluation capabilities for dynamic mode switching. To address this, we introduce TACT—the first benchmark supporting bidirectional, user- or system-initiated mode switching—and propose two novel evaluation metrics: Switch (measuring switching accuracy) and Recovery (assessing resilience after erroneous mode transitions). This enables the first systematic, transition-aware modeling and evaluation of multi-directional dialogue mode migration. Leveraging TACT for training with Direct Preference Optimization (DPO), our model achieves 75.74% accuracy on joint mode-intent recognition. In human evaluations, it outperforms GPT-4o with a 70.1% win rate, significantly surpassing all baselines.

Technology Category

Application Category

📝 Abstract

Conversational agents have traditionally been developed for either task-oriented dialogue (TOD) or open-ended chitchat, with limited progress in unifying the two. Yet, real-world conversations naturally involve fluid transitions between these modes. To address this gap, we introduce TACT (TOD-And-Chitchat Transition), a dataset designed for transition-aware dialogue modeling that incorporates structurally diverse and integrated mode flows. TACT supports both user- and agent-driven mode switches, enabling robust modeling of complex conversational dynamics. To evaluate an agent's ability to initiate and recover from mode transitions, we propose two new metrics -- Switch and Recovery. Models trained on TACT outperform baselines in both intent detection and mode transition handling. Moreover, applying Direct Preference Optimization (DPO) to TACT-trained models yields additional gains, achieving 75.74% joint mode-intent accuracy and a 70.1% win rate against GPT-4o in human evaluation. These results demonstrate that pairing structurally diverse data with DPO enhances response quality and transition control, paving the way for more proactive and transition-aware conversational agents.

Problem

Research questions and friction points this paper is trying to address.

Unifying task-oriented and chitchat dialogues with fluid transitions

Modeling both user-driven and agent-driven conversational mode switches

Enhancing intent detection and transition handling in dialogue systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dataset TACT enables TOD-chitchat transition modeling

Introduces Switch and Recovery metrics for evaluation

Applies DPO to enhance response quality and transitions

🔎 Similar Papers

No similar papers found.