Coset Ensemble Decoder for Quantum Error Correction with Algorithm-Hardware Co-Design

πŸ“… 2026-06-09
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Fault-tolerant quantum computing demands decoders that balance high logical accuracy with ultra-low latencyβ€”a trade-off existing approaches struggle to achieve. This work proposes a co-designed algorithm-hardware quantum error correction decoding scheme that introduces, for the first time, a coset ensemble decoding mechanism. It integrates coset-consistent candidate generation, ensemble forest exploration, reverse-order elimination, and lossless graph compression to enhance decoding accuracy while reducing algorithmic complexity. Concurrently, a time-multiplexed hardware architecture is developed, featuring multi-bank memory hashing and hierarchical ID mapping to prevent linear resource scaling with code distance. Evaluated under a circuit-level depolarizing noise model, the proposed approach significantly improves the accuracy-latency trade-off compared to MWPM and Union-Find decoders, achieving up to an 8.2Γ— reduction in FPGA LUT resource consumption.
πŸ“ Abstract
Reliable large-scale quantum computation relies on fault-tolerant architectures, where quantum error correction (QEC) continuously extracts and decodes error syndromes in real time. A critical component in QEC is the decoder, a classical subsystem that must simultaneously deliver high logical accuracy and ultra-low latency. This paper presents a novel algorithm-hardware co-design that improves the accuracy-latency trade-off over existing approaches such as vanilla Minimum-Weight Perfect Matching (MWPM) and Union-Find (UF) decoders. At the algorithmic level, we introduce coset ensemble decoding, which improves UF decoding by explicitly exploiting logically equivalent cosets. Our method performs ensemble forest exploration to generate multiple coset-consistent candidates and aggregates them to approximate coset-level maximum-likelihood decoding. We further reduce computational and memory complexity via reverse-order elimination and lossless graph compression, without sacrificing accuracy. At the hardware level, we design a domain-specific architecture that temporally reuses resources, avoiding the code-distance-proportional resource growth in prior spatial architectures. Several optimizations, such as multi-bank memory hashing and hierarchical ID mapping, are proposed to mitigate pipeline stalls and memory conflicts under highly concurrent access patterns. Under a circuit-level depolarizing noise model, our co-design approach achieves a better accuracy-latency trade-off than prior MWPM- and UF-based decoders, while reducing FPGA LUT consumption by up to 8.2 times compared with reported UF-based decoder resources. The tunable candidate number further exposes a flexible design knob, enabling users to tailor decoding performance to the requirements of different fault-tolerant workloads. Our implementation is publicly available at https://github.com/IMSeonL/coset-ensemble-decoder.
Problem

Research questions and friction points this paper is trying to address.

quantum error correction
decoder
accuracy-latency trade-off
fault-tolerant quantum computation
real-time decoding
Innovation

Methods, ideas, or system contributions that make the work stand out.

coset ensemble decoding
algorithm-hardware co-design
quantum error correction
domain-specific architecture
Union-Find decoder
S
Shuang Liang
Imperial College London, United Kingdom
J
Jubo Xu
Imperial College London, United Kingdom
G
Giulio Bassanino
Imperial College London, United Kingdom
Q
Qianzhou Wang
Imperial College London, United Kingdom
Yidong Zhou
Yidong Zhou
University of California, Davis
Causal InferenceDeep LearningFunctional Data AnalysisRandom ObjectsNonparametric Statistics
Y
Yuncheng Lu
Imperial College London, United Kingdom
Zhiwen Mo
Zhiwen Mo
Imperial College London
GPU ArchitecturePerformance ModelingDataflow Schedule
P
Paul H. J. Kelly
Imperial College London, United Kingdom
Bo Yuan
Bo Yuan
PhD Student in Machine Learning, Georgia Institute of Technology
Markov chain Monte CarloLarge Language Model
Wayne Luk
Wayne Luk
Professor of Computer Engineering, Imperial College London
Hardware and ArchitectutreReconfigurable ComputingDesign Automation
H
Hongxiang Fan
Imperial College London, United Kingdom