Curate-Train-Refine: A Closed-Loop Agentic Framework for Zero Shot Classification

📅 2026-01-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the high inference cost and latency that hinder the deployment of large language models (LLMs) in zero-shot text classification. To overcome this bottleneck, the authors propose a closed-loop agent framework that leverages the LLM as an intelligent data curator. Within this framework, the LLM dynamically generates training examples, performs error analysis on a lightweight classifier, and synthesizes targeted samples to iteratively refine the training data. Evaluated on four mainstream benchmarks, the approach significantly outperforms standard zero-shot and few-shot baselines, achieving high accuracy while substantially improving inference efficiency. This enables practical, deployable zero-shot classification without sacrificing performance.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) and high-capacity encoders have advanced zero and few-shot classification, but their inference cost and latency limit practical deployment. We propose training lightweight text classifiers using dynamically generated supervision from an LLM. Our method employs an iterative, agentic loop in which the LLM curates training data, analyzes model successes and failures, and synthesizes targeted examples to address observed errors. This closed-loop generation and evaluation process progressively improves data quality and adapts it to the downstream classifier and task. Across four widely used benchmarks, our approach consistently outperforms standard zero and few-shot baselines. These results indicate that LLMs can serve effectively as data curators, enabling accurate and efficient classification without the operational cost of large-model deployment.
Problem

Research questions and friction points this paper is trying to address.

zero-shot classification
large language models
inference cost
latency
lightweight classifiers
Innovation

Methods, ideas, or system contributions that make the work stand out.

closed-loop agentic framework
zero-shot classification
LLM-as-curator
dynamic data synthesis
lightweight classifier
🔎 Similar Papers
No similar papers found.