StructMG: A Fast and Scalable Structured Algebraic Multigrid

📅 2025-06-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing multigrid libraries exhibit insufficient speed and scalability for solving large-scale linear systems arising from structured grids. Method: This paper proposes an efficient parallel algebraic multigrid (AMG) preconditioner designed according to three principles: low computational overhead, strong convergence, and high parallelizability. It introduces the first stencil-based, symbolically derived triple-matrix multiplication for Galerkin coarsening, and develops a dependency-preserving unified parallel framework for sparse triangular solves, optimized across ARM and x86 architectures. Contribution/Results: Experiments on real-world applications—including radiation hydrodynamics—demonstrate that the proposed preconditioner achieves average speedups of 7.3×–15.5× over all hypre preconditioners. It significantly improves both strong and weak scaling efficiency, enabling scalable performance on large structured-grid problems.

Technology Category

Application Category

📝 Abstract
Parallel multigrid is widely used as preconditioners in solving large-scale sparse linear systems. However, the current multigrid library still needs more satisfactory performance for structured grid problems regarding speed and scalability. Based on the classical 'multigrid seesaw', we derive three necessary principles for an efficient structured multigrid, which instructs our design and implementation of StructMG, a fast and scalable algebraic multigrid that constructs hierarchical grids automatically. As a preconditioner, StructMG can achieve both low cost per iteration and good convergence when solving large-scale linear systems with iterative methods in parallel. A stencil-based triple-matrix product via symbolic derivation and code generation is proposed for multi-dimensional Galerkin coarsening to reduce grid complexity, operator complexity, and implementation effort. A unified parallel framework of sparse triangular solver is presented to achieve fast convergence and high parallel efficiency for smoothers, including dependence-preserving Gauss-Seidel and incomplete LU methods. Idealized and real-world problems from radiation hydrodynamics, petroleum reservoir simulation, numerical weather prediction, and solid mechanics, are evaluated on ARM and X86 platforms to show StructMG's effectiveness. In comparison to extit{hypre}'s structured and general multigrid preconditioners, StructMG achieves the fastest time-to-solutions in all cases with average speedups of 15.5x, 5.5x, 6.7x, 7.3x over SMG, PFMG, SysPFMG, and BoomerAMG, respectively. StructMG also significantly improves strong and weak scaling efficiencies.
Problem

Research questions and friction points this paper is trying to address.

Improving speed and scalability of structured multigrid methods
Reducing grid and operator complexity in Galerkin coarsening
Enhancing parallel efficiency for smoothers in multigrid preconditioners
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatic hierarchical grid construction for efficiency
Stencil-based triple-matrix product for coarsening
Unified parallel sparse triangular solver framework
🔎 Similar Papers
No similar papers found.
Yi Zong
Yi Zong
School of Computer Science, Fudan University
NLP
P
Peinan Yu
Tsinghua University, Beijing, China
H
Haopeng Huang
Tsinghua University, Beijing, China
Z
Zhengding Hu
University of Science and Technology of China, Hefei, China
X
Xinliang Wang
Huawei Technologies Co., Ltd, Beijing, China
Qin Wang
Qin Wang
ETH Zurich
Domain AdaptationComputer Vision
C
Chensong Zhang
Academy of Mathematics and Systems Science, Beijing, China
X
Xiaowen Xu
Institute of Applied Physics and Computational Mathematics, Beijing, China
J
Jian Sun
CMA Earth System Modeling and Prediction Center, Beijing, China
Y
Yongxiao Zhou
Tsinghua University, Beijing, China
W
Wei Xue
Tsinghua University, Beijing, China