🤖 AI Summary
To address the poor generalization and limited practicality of task-oriented dialogue (TOD) systems in low-resource settings—where annotated data is scarce—this paper proposes a lightweight end-to-end framework. It introduces an explicit task-instruction injection mechanism that encodes domain knowledge and task logic into structured, human-readable instructions, and integrates these with instruction-tuned lightweight large language models (LLMs) to jointly perform semantic parsing and response generation under few-shot or even zero-shot conditions. The approach drastically reduces reliance on manually annotated data: on the MultiWOZ benchmark, it achieves state-of-the-art performance using less than 10% of the full training annotations while improving inference efficiency by 40%. Its core contribution lies in being the first to synergistically combine explicit instruction-driven modeling with domain-specialized lightweight LLMs for low-resource TOD—thereby balancing high performance, computational efficiency, and deployability.
📝 Abstract
Task-oriented dialogue (TOD) systems facilitate goal-driven interactions between users and machines. While recent advances in deep learning have improved the performance, TOD systems often struggle in low-resource scenarios with limited labeled data. To address this challenge, we propose Spec-TOD, a novel framework designed to train an end-to-end TOD system with limited data. Spec-TOD introduces two main innovations: (i) a novel specialized end-to-end TOD framework that incorporates explicit task instructions for instruction-tuned large language models (LLMs), and (ii) an efficient training strategy that leverages lightweight, specialized LLMs to achieve strong performance with minimal supervision. Experiments on the MultiWOZ dataset, a widely used TOD benchmark, demonstrate that Spec-TOD achieves competitive results while significantly reducing the need for labeled data. These findings highlight the potential of the proposed framework in advancing efficient and effective TOD systems in low-resource settings.