Scalable On-Hardware Training of Quantum Neural Networks and Application to Clinical Data Imputation

📅 2026-06-02

📈 Citations: 0

✨ Influential: 0

career value

202K/year

🤖 AI Summary

This work addresses the scalability bottleneck in hardware-based training of quantum neural networks (QNNs), where conventional parameter-shift rules incur O(n²) circuit evaluations per optimization step. The authors propose an efficient training framework that integrates a structured butterfly circuit architecture—featuring O(n log n) parameters and logarithmic depth—with a layerwise optimization strategy and a parallelized parameter-shift rule leveraging intra-layer commutativity. This combination reduces the per-optimization circuit evaluation complexity to O(log n), marking the first hardware-native QNN training method achieving logarithmic evaluation complexity. The approach is successfully deployed on an IonQ Forte Enterprise trapped-ion device: a 16-qubit model performs clinical data imputation on the MIMIC-III dataset, matching or exceeding classical baselines with lower variance and superior performance in patient survival prediction; furthermore, 32-qubit inference is demonstrated on real hardware.

📝 Abstract

Training quantum neural networks (QNNs) on quantum hardware is currently bottlenecked by the cost of gradient estimation: standard parameter-shift methods require a number of circuit evaluations that grows quadratically with the number of trainable parameters, making hardware-based optimisation impractical beyond small system sizes. In this work, we introduce a training framework that reduces this cost to logarithmic in the number of qubits, making gradient-based QNN optimisation feasible on near-term hardware at increasing scales. Our framework combines three co-designed ingredients: (i) a structured, subspace-preserving Butterfly circuit architecture with $O(n \log n)$ parameters and logarithmic depth; (ii) a layer-wise training strategy that confines on-hardware optimisation to one small, well-structured layer at a time; and (iii) a parallelised parameter-shift rule that exploits the commuting structure within each Butterfly layer to extract all gradients in a constant number of circuit executions. Together these reduce the number of distinct circuit evaluations per optimisation step from $O(n^2)$ to $O(\log n)$. We validate the framework on clinical data imputation using the MIMIC-III electronic health record dataset, a demanding benchmark sensitive to optimisation instability and model variance. Hybrid classical-quantum models are trained directly on IonQ Forte Enterprise trapped-ion hardware at 16 qubits without performance degradation relative to ideal or noisy simulation and via tensor-network simulation at 32 qubits, with 32-qubit inference executed on hardware. The resulting models match or exceed strong classical neural baselines in downstream patient survival prediction while exhibiting reduced variance across runs, demonstrating that the proposed framework enables practical, scalable QNN training under realistic hardware constraints.

Problem

Research questions and friction points this paper is trying to address.

quantum neural networks

gradient estimation

quantum hardware

parameter-shift rule

scalable training

Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantum Neural Networks

On-Hardware Training

Butterfly Circuit