Curate-Train-Refine: A Closed-Loop Agentic Framework for Zero Shot Classification

📅 2026-01-23

📈 Citations: 0

✨ Influential: 0

career value

166K/year

🤖 AI Summary

This work addresses the high inference cost and latency that hinder the deployment of large language models (LLMs) in zero-shot text classification. To overcome this bottleneck, the authors propose a closed-loop agent framework that leverages the LLM as an intelligent data curator. Within this framework, the LLM dynamically generates training examples, performs error analysis on a lightweight classifier, and synthesizes targeted samples to iteratively refine the training data. Evaluated on four mainstream benchmarks, the approach significantly outperforms standard zero-shot and few-shot baselines, achieving high accuracy while substantially improving inference efficiency. This enables practical, deployable zero-shot classification without sacrificing performance.

Technology Category

Application Category

📝 Abstract

Large language models (LLMs) and high-capacity encoders have advanced zero and few-shot classification, but their inference cost and latency limit practical deployment. We propose training lightweight text classifiers using dynamically generated supervision from an LLM. Our method employs an iterative, agentic loop in which the LLM curates training data, analyzes model successes and failures, and synthesizes targeted examples to address observed errors. This closed-loop generation and evaluation process progressively improves data quality and adapts it to the downstream classifier and task. Across four widely used benchmarks, our approach consistently outperforms standard zero and few-shot baselines. These results indicate that LLMs can serve effectively as data curators, enabling accurate and efficient classification without the operational cost of large-model deployment.

Problem

Research questions and friction points this paper is trying to address.

zero-shot classification

large language models

inference cost

latency

lightweight classifiers

Innovation

Methods, ideas, or system contributions that make the work stand out.

closed-loop agentic framework

zero-shot classification

LLM-as-curator