CHARME: A chain-based reinforcement learning approach for the minor embedding problem

📅 2024-06-11

🏛️ arXiv.org

📈 Citations: 2

✨ Influential: 0

🤖 AI Summary

Quantum annealing for combinatorial optimization is hindered by minor embedding—a computationally intractable (NP-hard) mapping of logical graphs onto the restricted hardware topology of quantum processing units (QPUs). This paper introduces the first reinforcement learning–based chain embedding framework, integrating a graph neural network (GNN)-driven policy model, an explicit feasibility-preserving constraint mechanism during state transitions, and an order-guided efficient exploration strategy. Compared to fast heuristic methods (e.g., Minorminer, ATOM), our approach achieves significantly higher embedding success rates and solution quality on both synthetic and real-world problem instances. Against high-quality but computationally expensive methods (e.g., OCT), it substantially reduces runtime while maintaining comparable embedding quality. Moreover, our training efficiency markedly surpasses existing deep reinforcement learning–based embedding approaches, effectively alleviating scalability bottlenecks for large-scale problems.

Technology Category

Application Category

📝 Abstract

Quantum Annealing (QA) holds great potential for solving combinatorial optimization problems efficiently. However, the effectiveness of QA algorithms heavily relies on the embedding of problem instances, represented as logical graphs, into the quantum unit processing (QPU) whose topology is in form of a limited connectivity graph, known as the minor embedding Problem. Existing methods for the minor embedding problem suffer from scalability issues when confronted with larger problem sizes. In this paper, we propose a novel approach utilizing Reinforcement Learning (RL) techniques to address the minor embedding problem, named CHARME. CHARME includes three key components: a Graph Neural Network (GNN) architecture for policy modeling, a state transition algorithm ensuring solution validity, and an order exploration strategy for effective training. Through comprehensive experiments on synthetic and real-world instances, we demonstrate that the efficiency of our proposed order exploration strategy as well as our proposed RL framework, CHARME. In details, CHARME yields superior solutions compared to fast embedding methods such as Minorminer and ATOM. Moreover, our method surpasses the OCT-based approach, known for its slower runtime but high-quality solutions, in several cases. In addition, our proposed exploration enhances the efficiency of the training of the CHARME framework by providing better solutions compared to the greedy strategy.

Problem

Research questions and friction points this paper is trying to address.

Solving NP-hard minor embedding for quantum annealing

Reducing qubit usage in quantum optimization problems

Improving scalability of quantum algorithm embeddings

Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning for minor embedding problem

Graph neural network for policy modeling

State transition ensuring solution validity

🔎 Similar Papers

No similar papers found.

Authors to Follow