🤖 AI Summary
Current AutoML tools lack end-to-end automation for text classification, particularly in embedding model selection, classifier hyperparameter optimization, decision threshold tuning, multi-label classification, and out-of-distribution (OOD) detection. To address these gaps, we propose a modular, scikit-learn–compatible automated text classification framework. Our method unifies embedding selection, classifier optimization, and dynamic threshold tuning into a single end-to-end pipeline—the first to natively support both multi-label classification and intent-out-of-scope (OOD) detection. The framework enables configurable accuracy-efficiency trade-offs, enhancing adaptability and deployment flexibility. Extensive experiments on standard intent classification benchmarks demonstrate that our approach consistently outperforms state-of-the-art AutoML tools in accuracy, macro-F1 score, and OOD detection performance. The design ensures practical usability, scalability, and seamless integration into existing ML workflows.
📝 Abstract
AutoIntent is an automated machine learning tool for text classification tasks. Unlike existing solutions, AutoIntent offers end-to-end automation with embedding model selection, classifier optimization, and decision threshold tuning, all within a modular, sklearn-like interface. The framework is designed to support multi-label classification and out-of-scope detection. AutoIntent demonstrates superior performance compared to existing AutoML tools on standard intent classification datasets and enables users to balance effectiveness and resource consumption.