Plasma Shape Control via Zero-shot Generative Reinforcement Learning

📅 2025-10-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Conventional PID controllers exhibit poor adaptability in plasma shape control, while existing reinforcement learning (RL) methods suffer from weak generalization and require task-specific retraining. Method: This paper proposes a zero-shot generative RL framework leveraging large-scale offline PID demonstration data. It innovatively integrates Generative Adversarial Imitation Learning (GAIL) with Hilbert-space representation learning to embed geometric structure into the latent space, enabling zero-shot cross-trajectory policy transfer without online fine-tuning. Contribution/Results: The framework achieves stable and precise tracking of diverse plasma shape reference trajectories on the HL-3 tokamak simulator. It significantly enhances control flexibility and historical data utilization efficiency, offering a scalable, robust paradigm for intelligent control of fusion devices.

Technology Category

Application Category

📝 Abstract
Traditional PID controllers have limited adaptability for plasma shape control, and task-specific reinforcement learning (RL) methods suffer from limited generalization and the need for repetitive retraining. To overcome these challenges, this paper proposes a novel framework for developing a versatile, zero-shot control policy from a large-scale offline dataset of historical PID-controlled discharges. Our approach synergistically combines Generative Adversarial Imitation Learning (GAIL) with Hilbert space representation learning to achieve dual objectives: mimicking the stable operational style of the PID data and constructing a geometrically structured latent space for efficient, goal-directed control. The resulting foundation policy can be deployed for diverse trajectory tracking tasks in a zero-shot manner without any task-specific fine-tuning. Evaluations on the HL-3 tokamak simulator demonstrate that the policy excels at precisely and stably tracking reference trajectories for key shape parameters across a range of plasma scenarios. This work presents a viable pathway toward developing highly flexible and data-efficient intelligent control systems for future fusion reactors.
Problem

Research questions and friction points this paper is trying to address.

Developing versatile plasma shape control without retraining
Overcoming limited generalization in traditional control methods
Creating zero-shot policy for diverse trajectory tracking tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Zero-shot generative reinforcement learning for plasma control
Combining GAIL with Hilbert space representation learning
Foundation policy enables diverse tracking without fine-tuning
🔎 Similar Papers
No similar papers found.
N
Niannian Wu
Zhejiang University, Hangzhou 310058, China
Rongpeng Li
Rongpeng Li
Zhejiang University
Multi-Agent CommunicationsNetGPTMARLNetwork SlicingAI for Fusion
Z
Zongyu Yang
Southwestern Institute of Physics, Chengdu 610043, China
Y
Yong Xiao
Zhejiang University, Hangzhou 310058, China
N
Ning Wei
Zhejiang Lab, Hangzhou 311500, China
Y
Yihang Chen
Southwestern Institute of Physics, Chengdu 610043, China
B
Bo Li
Southwestern Institute of Physics, Chengdu 610043, China
Z
Zhifeng Zhao
Zhejiang Lab, Hangzhou 311500, China
W
Wulyu Zhong
Southwestern Institute of Physics, Chengdu 610043, China