Fast Jet Tagging with MLP-Mixers on FPGAs

πŸ“… 2025-03-05
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the challenge of deploying real-time jet tagging for the Large Hadron Collider (LHC) on resource-constrained hardware such as FPGAs, this work introduces MLP-Mixerβ€”the first application of this architecture to high-energy physics jet classification. We propose a non-permutation-invariant sequence encoding scheme to enable feature-priority scheduling and integrate fine-grained quantization with distributed arithmetic for FPGA co-optimization. Compared to state-of-the-art models, our approach achieves comparable or superior classification accuracy while reducing FPGA resource utilization by 97%, doubling throughput, and cutting inference latency by 50%. This work overcomes the efficiency bottleneck of sequence modeling on FPGAs and establishes a new benchmark for real-time AI inference in high-energy physics.

Technology Category

Application Category

πŸ“ Abstract
We explore the innovative use of MLP-Mixer models for real-time jet tagging and establish their feasibility on resource-constrained hardware like FPGAs. MLP-Mixers excel in processing sequences of jet constituents, achieving state-of-the-art performance on datasets mimicking Large Hadron Collider conditions. By using advanced optimization techniques such as High-Granularity Quantization and Distributed Arithmetic, we achieve unprecedented efficiency. These models match or surpass the accuracy of previous architectures, reduce hardware resource usage by up to 97%, double the throughput, and half the latency. Additionally, non-permutation-invariant architectures enable smart feature prioritization and efficient FPGA deployment, setting a new benchmark for machine learning in real-time data processing at particle colliders.
Problem

Research questions and friction points this paper is trying to address.

Real-time jet tagging using MLP-Mixer models on FPGAs.
Achieving high efficiency with advanced optimization techniques.
Setting benchmarks for ML in particle collider data processing.
Innovation

Methods, ideas, or system contributions that make the work stand out.

MLP-Mixer models for real-time jet tagging
High-Granularity Quantization and Distributed Arithmetic
Non-permutation-invariant architectures for FPGA deployment
πŸ”Ž Similar Papers
No similar papers found.
C
Chang Sun
California Institute of Technology, Pasadena, CA, USA
Jennifer Ngadiuba
Jennifer Ngadiuba
Wilson Fellow, Fermilab
experimental high-energy physicsdata sciencedeep learningartificial intelligenceFPGAs
Maurizio Pierini
Maurizio Pierini
CERN
Particle PhysicsMachine Learning
M
Maria Spiropulu
California Institute of Technology, Pasadena, CA, USA