🤖 AI Summary
To address the high computational cost and overfitting risks of conventional AutoML stemming from global search over fixed, exhaustive search spaces, this paper proposes a meta-learning–based dynamic search space optimization method. It leverages historical task metadata to adaptively prune both model selection and hyperparameter search spaces, enabling dynamic pipeline construction. The approach innovatively integrates meta-feature selection with an interpretable meta-model, balancing search efficiency and generalization capability. Integrated into the Auto-Sklearn framework, empirical evaluation demonstrates an 89% reduction in random search runtime; the preprocessor and classifier search spaces are compressed to 13.8% and 26.9% of their original sizes, respectively, while maintaining state-of-the-art predictive performance. Overall, the method significantly enhances AutoML’s computational efficiency, robustness, and interpretability.
📝 Abstract
Automated machine learning (AutoML) has democratized the design of machine learning based systems, by automating model selection, hyperparameter tuning and feature engineering. However, the high computational cost associated with traditional search and optimization strategies, such as Random Search, Particle Swarm Optimization and Bayesian Optimization, remains a significant challenge. Moreover, AutoML systems typically explore a large search space, which can lead to overfitting. This paper introduces a metalearning method for dynamically designing search spaces for AutoML system. The proposed method uses historical metaknowledge to select promising regions of the search space, accelerating the optimization process. According to experiments conducted for this study, the proposed method can reduce runtime by 89% in Random Search and search space by (1.8/13 preprocessor and 4.3/16 classifier), without compromising significant predictive performance. Moreover, the proposed method showed competitive performance when adapted to Auto-Sklearn, reducing its search space. Furthermore, this study encompasses insights into meta-feature selection, meta-model explainability, and the trade-offs inherent in search space reduction strategies.