π€ AI Summary
Job Shop Scheduling (JSP), an NP-hard combinatorial optimization problem, suffers from error accumulation in conventional sequential decision-making approaches, degrading solution quality. This paper proposes the first self-evaluation-driven parallel solving framework for JSP: it abandons incremental construction and instead jointly models taskβresource dependencies via a heterogeneous graph neural network integrated with a Transformer, enabling concurrent learning of policy generation and solution quality assessment. A novel self-evaluation mechanism enables parallel generation and filtering of task-assignment subsets, optimized end-to-end via reinforcement learning. Evaluated on multiple classical benchmarks, the method achieves average makespan improvements of 3.2%β7.8% over state-of-the-art approaches, demonstrating the effectiveness, robustness, and generalizability of the self-evaluation paradigm in combinatorial optimization.
π Abstract
Combinatorial optimization problems, such as scheduling and route planning, are crucial in various industries but are computationally intractable due to their NP-hard nature. Neural Combinatorial Optimization methods leverage machine learning to address these challenges but often depend on sequential decision-making, which is prone to error accumulation as small mistakes propagate throughout the process. Inspired by self-evaluation techniques in Large Language Models, we propose a novel framework that generates and evaluates subsets of assignments, moving beyond traditional stepwise approaches. Applied to the Job-Shop Scheduling Problem, our method integrates a heterogeneous graph neural network with a Transformer to build a policy model and a self-evaluation function. Experimental validation on challenging, well-known benchmarks demonstrates the effectiveness of our approach, surpassing state-of-the-art methods.