Scalable On-Hardware Training of Quantum Neural Networks and Application to Clinical Data Imputation

📅 2026-06-02
📈 Citations: 0
Influential: 0
📄 PDF

career value

202K/year
🤖 AI Summary
This work addresses the scalability bottleneck in hardware-based training of quantum neural networks (QNNs), where conventional parameter-shift rules incur O(n²) circuit evaluations per optimization step. The authors propose an efficient training framework that integrates a structured butterfly circuit architecture—featuring O(n log n) parameters and logarithmic depth—with a layerwise optimization strategy and a parallelized parameter-shift rule leveraging intra-layer commutativity. This combination reduces the per-optimization circuit evaluation complexity to O(log n), marking the first hardware-native QNN training method achieving logarithmic evaluation complexity. The approach is successfully deployed on an IonQ Forte Enterprise trapped-ion device: a 16-qubit model performs clinical data imputation on the MIMIC-III dataset, matching or exceeding classical baselines with lower variance and superior performance in patient survival prediction; furthermore, 32-qubit inference is demonstrated on real hardware.
📝 Abstract
Training quantum neural networks (QNNs) on quantum hardware is currently bottlenecked by the cost of gradient estimation: standard parameter-shift methods require a number of circuit evaluations that grows quadratically with the number of trainable parameters, making hardware-based optimisation impractical beyond small system sizes. In this work, we introduce a training framework that reduces this cost to logarithmic in the number of qubits, making gradient-based QNN optimisation feasible on near-term hardware at increasing scales. Our framework combines three co-designed ingredients: (i) a structured, subspace-preserving Butterfly circuit architecture with $O(n \log n)$ parameters and logarithmic depth; (ii) a layer-wise training strategy that confines on-hardware optimisation to one small, well-structured layer at a time; and (iii) a parallelised parameter-shift rule that exploits the commuting structure within each Butterfly layer to extract all gradients in a constant number of circuit executions. Together these reduce the number of distinct circuit evaluations per optimisation step from $O(n^2)$ to $O(\log n)$. We validate the framework on clinical data imputation using the MIMIC-III electronic health record dataset, a demanding benchmark sensitive to optimisation instability and model variance. Hybrid classical-quantum models are trained directly on IonQ Forte Enterprise trapped-ion hardware at 16 qubits without performance degradation relative to ideal or noisy simulation and via tensor-network simulation at 32 qubits, with 32-qubit inference executed on hardware. The resulting models match or exceed strong classical neural baselines in downstream patient survival prediction while exhibiting reduced variance across runs, demonstrating that the proposed framework enables practical, scalable QNN training under realistic hardware constraints.
Problem

Research questions and friction points this paper is trying to address.

quantum neural networks
gradient estimation
quantum hardware
parameter-shift rule
scalable training
Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantum Neural Networks
On-Hardware Training
Butterfly Circuit
Parameter-Shift Rule
Scalable Quantum Machine Learning
🔎 Similar Papers
No similar papers found.