A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints

📅 2025-06-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Real-world production scheduling confronts multiple dynamic constraints—including material handling logistics, buffer capacity limits, machine failures, setup times, and stochastic processing durations—posing challenges for both modeling generality and evaluation fairness in conventional methods and existing RL frameworks. This paper proposes the first modular, customizable RL-based scheduling framework that uniformly models heterogeneous uncertainties, supports multi-objective optimization, and enables flexible scenario configuration. Leveraging a high-fidelity discrete-event simulation environment as a digital twin, the framework integrates PPO/SAC algorithms, multi-objective reward shaping, enhanced state encoding, and robust training strategies. Evaluated using the open-source toolkit JobShopLab, the proposed RL policies achieve, under diverse disturbances, an average 18.7% reduction in weighted tardiness and a 12.3% improvement in overall equipment effectiveness (OEE) compared to rule-based schedulers.

Technology Category

Application Category

📝 Abstract
The classical Job Shop Scheduling Problem (JSSP) focuses on optimizing makespan under deterministic constraints. Real-world production environments introduce additional complexities that cause traditional scheduling approaches to be less effective. Reinforcement learning (RL) holds potential in addressing these challenges, as it allows agents to learn adaptive scheduling strategies. However, there is a lack of a comprehensive, general-purpose frameworks for effectively training and evaluating RL agents under real-world constraints. To address this gap, we propose a modular framework that extends classical JSSP formulations by incorporating key mbox{real-world} constraints inherent to the shopfloor, including transport logistics, buffer management, machine breakdowns, setup times, and stochastic processing conditions, while also supporting multi-objective optimization. The framework is a customizable solution that offers flexibility in defining problem instances and configuring simulation parameters, enabling adaptation to diverse production scenarios. A standardized interface ensures compatibility with various RL approaches, providing a robust environment for training RL agents and facilitating the standardized comparison of different scheduling methods under dynamic and uncertain conditions. We release JobShopLab as an open-source tool for both research and industrial applications, accessible at: https://github.com/proto-lab-ro/jobshoplab
Problem

Research questions and friction points this paper is trying to address.

Extends JSSP with real-world constraints like transport and breakdowns
Lacks general RL framework for dynamic production scheduling
Needs standardized comparison of scheduling methods under uncertainty
Innovation

Methods, ideas, or system contributions that make the work stand out.

Modular framework for RL scheduling
Incorporates real-world production constraints
Standardized interface for RL compatibility
🔎 Similar Papers
No similar papers found.
J
Jonathan Hoss
Faculty of Management and Engineering, Rosenheim Technical University of Applied Sciences, Germany
F
Felix Schelling
Faculty of Management and Engineering, Rosenheim Technical University of Applied Sciences, Germany
Noah Klarmann
Noah Klarmann
Full Professor, Rosenheim Technical University of Applied Sciences
Artificial IntelligenceMachine LearningReinforcement Learning