Minimizing the Weighted Number of Tardy Jobs: Data-Driven Heuristic for Single-Machine Scheduling

📅 2025-08-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the single-machine weighted tardiness minimization problem (1||∑w_jU_j), which seeks to minimize the total weight of tardy jobs. To overcome the significant performance degradation of existing exact algorithms on certain problem instances, we propose a data-driven heuristic approach: leveraging structured features—including job weights, processing times, and due dates—we employ supervised learning to predict critical decision variables, then integrate these predictions with domain-specific heuristics to construct feasible schedules. Systematic model selection and feature engineering enhance generalization across diverse problem distributions. Experimental results demonstrate that our method consistently outperforms state-of-the-art algorithms in three key aspects: average optimality gap, number of provably optimal solutions found, and robustness across heterogeneous data distributions. By bridging machine learning and combinatorial optimization, this work establishes a new paradigm for NP-hard scheduling problems—offering efficiency, interpretability, and practical applicability.

Technology Category

Application Category

📝 Abstract
Existing research on single-machine scheduling is largely focused on exact algorithms, which perform well on typical instances but can significantly deteriorate on certain regions of the problem space. In contrast, data-driven approaches provide strong and scalable performance when tailored to the structure of specific datasets. Leveraging this idea, we focus on a single-machine scheduling problem where each job is defined by its weight, duration, due date, and deadline, aiming to minimize the total weight of tardy jobs. We introduce a novel data-driven scheduling heuristic that combines machine learning with problem-specific characteristics, ensuring feasible solutions, which is a common challenge for ML-based algorithms. Experimental results demonstrate that our approach significantly outperforms the state-of-the-art in terms of optimality gap, number of optimal solutions, and adaptability across varied data scenarios, highlighting its flexibility for practical applications. In addition, we conduct a systematic exploration of ML models, addressing a common gap in similar studies by offering a detailed model selection process and providing insights into why the chosen model is the best fit.
Problem

Research questions and friction points this paper is trying to address.

Minimizing weighted tardy jobs in single-machine scheduling
Developing data-driven heuristic combining ML and scheduling characteristics
Ensuring feasible solutions while outperforming existing algorithms
Innovation

Methods, ideas, or system contributions that make the work stand out.

Data-driven heuristic combining machine learning
Ensures feasible solutions with problem-specific characteristics
Systematic ML model selection for adaptability
🔎 Similar Papers
No similar papers found.
N
Nikolai Antonov
Czech Technical University in Prague, Faculty of Electrical Engineering, Technická 2, Prague, 166 27, Czech Republic
P
Prěmysl Šůcha
Czech Technical University in Prague, CIIRC, Jugoslávských partyzánů 1580/3, Prague, 160 00, Czech Republic
Mikoláš Janota
Mikoláš Janota
CTU Prague
SMTMachine learningQuantifiersFormal Methods
J
Jan Hůla
Czech Technical University in Prague, CIIRC, Jugoslávských partyzánů 1580/3, Prague, 160 00, Czech Republic