LLM-Guided Lifecycle-Aware Clustering of Multi-Turn Customer Support Conversations

📅 2026-01-07

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

170K/year

🤖 AI Summary

This work addresses the limitations of traditional clustering methods in multi-turn customer support dialogues, which often yield static, overly broad clusters that degrade over time, while full reclustering disrupts continuity in issue tracking. To overcome this, the authors propose a lifecycle-aware incremental clustering mechanism that first segments dialogues by service focus and then leverages a large language model (LLM) to dynamically split quality-degraded clusters without requiring global reclustering. The approach significantly enhances clustering quality, achieving over a 100% improvement in silhouette coefficient and a 65.6% reduction in Davies–Bouldin index. It effectively balances timeliness, coherence, and scalability, enabling real-time analysis of customer conversations.

Technology Category

Application Category

📝 Abstract

Clustering customer chat data is vital for cloud providers handling multi service queries. Traditional methods struggle with overlapping concerns and create broad, static clusters that degrade over time. Reclustering disrupts continuity, making issue tracking difficult. We propose an adaptive system that segments multi turn chats into service specific concerns and incrementally refines clusters as new issues arise. Cluster quality is tracked via DaviesBouldin Index and Silhouette Scores, with LLM based splitting applied only to degraded clusters. Our method improves Silhouette Scores by over 100\% and reduces DBI by 65.6\% compared to baselines, enabling scalable, real time analytics without full reclustering.

Problem

Research questions and friction points this paper is trying to address.

clustering

customer support conversations

multi-turn dialogues

lifecycle-aware

overlap concerns

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-guided clustering

lifecycle-aware

incremental clustering