Autonomous Resource Management in Microservice Systems via Reinforcement Learning

📅 2025-07-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address resource imbalance, high latency, low throughput, and challenges in coordinated scheduling of multi-dimensional resources (e.g., CPU, memory, storage) in microservice systems under scale expansion and dynamic workloads, this paper proposes a reinforcement learning–based multi-objective adaptive resource scheduling framework. The framework integrates dynamic workload awareness with joint modeling of heterogeneous resources, enabling continuous policy optimization through environment interaction to achieve fine-grained, real-time, and scenario-adaptive autonomous resource management. Experimental results demonstrate that, compared to static and heuristic approaches, the proposed method reduces average response latency by 32.7%, increases throughput by 28.4%, improves resource utilization by 21.5%, and decreases energy consumption by 19.3% across both high- and low-load scenarios. Moreover, it supports simultaneous optimization of latency, throughput, and energy efficiency—achieving true multi-objective coordination.

Technology Category

Application Category

📝 Abstract
This paper proposes a reinforcement learning-based method for microservice resource scheduling and optimization, aiming to address issues such as uneven resource allocation, high latency, and insufficient throughput in traditional microservice architectures. In microservice systems, as the number of services and the load increase, efficiently scheduling and allocating resources such as computing power, memory, and storage becomes a critical research challenge. To address this, the paper employs an intelligent scheduling algorithm based on reinforcement learning. Through the interaction between the agent and the environment, the resource allocation strategy is continuously optimized. In the experiments, the paper considers different resource conditions and load scenarios, evaluating the proposed method across multiple dimensions, including response time, throughput, resource utilization, and cost efficiency. The experimental results show that the reinforcement learning-based scheduling method significantly improves system response speed and throughput under low load and high concurrency conditions, while also optimizing resource utilization and reducing energy consumption. Under multi-dimensional resource conditions, the proposed method can consider multiple objectives and achieve optimized resource scheduling. Compared to traditional static resource allocation methods, the reinforcement learning model demonstrates stronger adaptability and optimization capability. It can adjust resource allocation strategies in real time, thereby maintaining good system performance in dynamically changing load and resource environments.
Problem

Research questions and friction points this paper is trying to address.

Optimize microservice resource allocation using reinforcement learning
Improve system performance under varying load conditions
Reduce latency and energy consumption in dynamic environments
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning optimizes microservice resource allocation
Intelligent scheduling adapts to dynamic load conditions
Multi-objective optimization improves performance and efficiency
🔎 Similar Papers
No similar papers found.
Yujun Zou
Yujun Zou
University of California, Berkeley
Large Language ModelsAI Agent
N
Nia Qi
Independent Author
Y
Yingnan Deng
Georgia Institute of Technology
Z
Zhihao Xue
Rose-Hulman Institute of Technology
Ming Gong
Ming Gong
Key laboratory of quantum information, USTC
quantum informationquantum dottopological quantum phase transitionultracold atomsFFLO
W
Wuyang Zhang
University of Massachusetts Amherst