Dynamic Design of Machine Learning Pipelines via Metalearning

📅 2025-08-18

📈 Citations: 0

✨ Influential: 0

career value

223K/year

🤖 AI Summary

To address the high computational cost and overfitting risks of conventional AutoML stemming from global search over fixed, exhaustive search spaces, this paper proposes a meta-learning–based dynamic search space optimization method. It leverages historical task metadata to adaptively prune both model selection and hyperparameter search spaces, enabling dynamic pipeline construction. The approach innovatively integrates meta-feature selection with an interpretable meta-model, balancing search efficiency and generalization capability. Integrated into the Auto-Sklearn framework, empirical evaluation demonstrates an 89% reduction in random search runtime; the preprocessor and classifier search spaces are compressed to 13.8% and 26.9% of their original sizes, respectively, while maintaining state-of-the-art predictive performance. Overall, the method significantly enhances AutoML’s computational efficiency, robustness, and interpretability.

Technology Category

Application Category

📝 Abstract

Automated machine learning (AutoML) has democratized the design of machine learning based systems, by automating model selection, hyperparameter tuning and feature engineering. However, the high computational cost associated with traditional search and optimization strategies, such as Random Search, Particle Swarm Optimization and Bayesian Optimization, remains a significant challenge. Moreover, AutoML systems typically explore a large search space, which can lead to overfitting. This paper introduces a metalearning method for dynamically designing search spaces for AutoML system. The proposed method uses historical metaknowledge to select promising regions of the search space, accelerating the optimization process. According to experiments conducted for this study, the proposed method can reduce runtime by 89% in Random Search and search space by (1.8/13 preprocessor and 4.3/16 classifier), without compromising significant predictive performance. Moreover, the proposed method showed competitive performance when adapted to Auto-Sklearn, reducing its search space. Furthermore, this study encompasses insights into meta-feature selection, meta-model explainability, and the trade-offs inherent in search space reduction strategies.

Problem

Research questions and friction points this paper is trying to address.

Reducing AutoML computational cost via metalearning

Dynamically designing search spaces to prevent overfitting

Accelerating optimization using historical metaknowledge

Innovation

Methods, ideas, or system contributions that make the work stand out.

Metalearning dynamically designs AutoML search spaces

Uses historical metaknowledge to accelerate optimization process

Reduces search space size without compromising predictive performance

🔎 Similar Papers

Online Loss Function Learning

2023-01-30arXiv.orgCitations: 5

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

2024-02-04arXiv.orgCitations: 3

💼 Related Jobs

Machine Learning Engineer