🤖 AI Summary
Prostate cryoablation needle placement planning relies heavily on expert experience, resulting in time-consuming workflows, poor reproducibility, and limited clinical scalability. This work formulates the task as a Markov Decision Process and proposes the first fully automated planning method based on deep reinforcement learning: it requires no expert-annotated data and autonomously optimizes probe configurations in simulation by jointly incorporating anatomical constraints and intraoperative uncertainty—maximizing tumor coverage (Dice score) while minimizing damage to critical structures. Evaluated on 583 retrospective cases, our method achieves a Dice score 8.2 percentage points higher than a geometric optimization baseline, matching expert performance, with planning time reduced to seconds per case. The core contribution is the development of the first end-to-end reinforcement learning framework tailored for cryoablation planning, significantly enhancing automation, safety, and clinical applicability.
📝 Abstract
Cryoablation is a minimally invasive localised treatment for prostate cancer that destroys malignant tissue during de-freezing, while sparing surrounding healthy structures. Its success depends on accurate preoperative planning of cryoprobe placements to fully cover the tumour and avoid critical anatomy. This planning is currently manual, expertise-dependent, and time-consuming, leading to variability in treatment quality and limited scalability. In this work, we introduce Cryo-RL, a reinforcement learning framework that models cryoablation planning as a Markov decision process and learns an optimal policy for cryoprobe placement. Within a simulated environment that models clinical constraints and stochastic intraoperative variability, an agent sequentially selects cryoprobe positions and ice sphere diameters. Guided by a reward function based on tumour coverage, this agent learns a cryoablation strategy that leads to optimal cryoprobe placements without the need for any manually-designed plans. Evaluated on 583 retrospective prostate cancer cases, Cryo-RL achieved over 8 percentage-point Dice improvements compared with the best automated baselines, based on geometric optimisation, and matched human expert performance while requiring substantially less planning time. These results highlight the potential of reinforcement learning to deliver clinically viable, reproducible, and efficient cryoablation plans.