đ¤ AI Summary
Emerging exascale supercomputers demand rigorous co-verification of hardware and software stacks, particularly for complex multi-physics applications. Method: This work introduces an automated benchmarking framework built upon the multi-component flow solver MFC, featuring a cross-platform toolchain that enables input generation, automatic compilation across Intel, Cray, NVIDIA, AMD, and GNU compilers, heterogeneous job scheduling, and fine-grained performance profilingâusing time-to-solution per grid point per time step as the primary metricâacross diverse CPU and five generations of NVIDIA and three generations of AMD GPU architectures. Contribution/Results: The framework lowers barriers to compilerâhardware co-validation while enabling high reuse and joint correctnessâperformance assessment. It has been deployed on over 50 systems, including five flagship exascale platforms (e.g., Frontier, El Capitan), uncovering multiple previously unreported compiler bugs and performance regressionsâproviding empirical evidence to enhance reliability and optimization in the HPC ecosystem.
đ Abstract
Deploying new supercomputers requires testing and evaluation via application codes. Portable, user-friendly tools enable evaluation, and the Multicomponent Flow Code (MFC), a computational fluid dynamics (CFD) code, addresses this need. MFC is adorned with a toolchain that automates input generation, compilation, batch job submission, regression testing, and benchmarking. The toolchain design enables users to evaluate compiler-hardware combinations for correctness and performance with limited software engineering experience. As with other PDE solvers, wall time per spatially discretized grid point serves as a figure of merit. We present MFC benchmarking results for five generations of NVIDIA GPUs, three generations of AMD GPUs, and various CPU architectures, utilizing Intel, Cray, NVIDIA, AMD, and GNU compilers. These tests have revealed compiler bugs and regressions on recent machines such as Frontier and El Capitan. MFC has benchmarked approximately 50 compute devices and 5 flagship supercomputers.