AutoIntent: AutoML for Text Classification

📅 2025-09-25

📈 Citations: 0

✨ Influential: 0

career value

155K/year

🤖 AI Summary

Current AutoML tools lack end-to-end automation for text classification, particularly in embedding model selection, classifier hyperparameter optimization, decision threshold tuning, multi-label classification, and out-of-distribution (OOD) detection. To address these gaps, we propose a modular, scikit-learn–compatible automated text classification framework. Our method unifies embedding selection, classifier optimization, and dynamic threshold tuning into a single end-to-end pipeline—the first to natively support both multi-label classification and intent-out-of-scope (OOD) detection. The framework enables configurable accuracy-efficiency trade-offs, enhancing adaptability and deployment flexibility. Extensive experiments on standard intent classification benchmarks demonstrate that our approach consistently outperforms state-of-the-art AutoML tools in accuracy, macro-F1 score, and OOD detection performance. The design ensures practical usability, scalability, and seamless integration into existing ML workflows.

Technology Category

Application Category

📝 Abstract

AutoIntent is an automated machine learning tool for text classification tasks. Unlike existing solutions, AutoIntent offers end-to-end automation with embedding model selection, classifier optimization, and decision threshold tuning, all within a modular, sklearn-like interface. The framework is designed to support multi-label classification and out-of-scope detection. AutoIntent demonstrates superior performance compared to existing AutoML tools on standard intent classification datasets and enables users to balance effectiveness and resource consumption.

Problem

Research questions and friction points this paper is trying to address.

Automating end-to-end text classification with model selection and optimization

Supporting multi-label classification and out-of-scope detection capabilities

Balancing classification effectiveness with computational resource consumption

Innovation

Methods, ideas, or system contributions that make the work stand out.

Automated embedding selection and classifier optimization

Modular sklearn-like interface for end-to-end automation

Multi-label classification with out-of-scope detection capability

🔎 Similar Papers

No similar papers found.