Frequency-Domain Latent Attention Gating for Cross-Domain Token Aggregation

πŸ“… 2026-06-06
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing cross-domain models rely solely on raw time-domain representations for token aggregation, struggling to effectively capture global and frequency-domain characteristics. This work proposes FLaG, a novel module that uniquely integrates frequency-domain analysis with implicit attention gating: tokens are first mapped into the frequency domain via real-valued FFT, then spectral components are aggregated using learnable implicit queries, and finally, a channel-wise gating mechanism reconstructs an enhanced time-domain representation for downstream pooling. FLaG establishes a universal, plug-and-play aggregation mechanism applicable across modalities and reveals a spectral pattern wherein low-frequency components dominate semantic content while high-frequency ones encode sample-specific details. Experiments demonstrate significant performance gains on ESM2-8M antimicrobial peptide prediction and CIFAR-100 image classification, competitive results on IMDB and GLUE text benchmarks, and multidimensional probing confirms FLaG’s sensitivity to and effective exploitation of spectral structures.
πŸ“ Abstract
Token aggregation is a common bottleneck in models that map token representations to sample-level predictions, yet most pooling methods operate only in the original token domain. We propose FLaG, a plug-in aggregation module that transforms token representations with the real FFT, summarizes spectral components with learnable latent queries, applies a channel-wise gate, and reconstructs enhanced time-domain tokens for final pooling. We evaluate FLaG on antimicrobial peptide (AMP) activity prediction with ESM2, image classification with ResNet18 on CIFAR-10 and CIFAR-100, and text classification with RoBERTa on IMDB and GLUE. FLaG achieves its clearest gains on the ESM2-8M antimicrobial peptide tasks and on CIFAR-100, while remaining competitive with strong text baselines on IMDB and GLUE. Then we probe its behavior on the AMP setting with band knockouts, gate summaries, residue perturbations, latent-query readouts, and structure-proxy stratification. We find that low-frequency bands contribute the most overall, and the remaining higher-band pattern is more sample-specific. The gate acts as a broadly shared spectral reweighting stage and the cross-attention patterns are sample-specific with mild query-wise differentiation, and higher-helix peptides exhibit stronger average spectral sensitivity in both bacteria. The supplementary materials, source code and data are released at https://www.healthinformaticslab.org/supp/ and https://github.com/Kewei2023/AMPCliff/tree/FLaG.
Problem

Research questions and friction points this paper is trying to address.

token aggregation
cross-domain
frequency domain
sample-level prediction
pooling
Innovation

Methods, ideas, or system contributions that make the work stand out.

frequency-domain attention
token aggregation
latent queries
spectral gating
cross-domain representation
πŸ”Ž Similar Papers
No similar papers found.
K
Kewei Li
College of Computer Science and Technology, Jilin University, Changchun, 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, China
R
Rongying Zhang
Institute for Quantitative and Computational Biology, University of California, Los Angeles, 90024, United States of America
Xueli Wang
Xueli Wang
Shanghai Maritime University
Adaptive dynamic programmingoptimal controldiscrete-time system
Xiwen Gong
Xiwen Gong
Assistant Professor of Chemical Engineering, University of Michigan
NanomaterialsOptoelectronicsBioelectronics
Z
Zhongjian Wang
BCPM Data Limited, Chengdu 610041, China
L
Lan Huang
College of Computer Science and Technology, Jilin University, Changchun, 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, China
Ruochi Zhang
Ruochi Zhang
Broad Institute
deep learningcomputational biology
Fengfeng Zhou
Fengfeng Zhou
Bioinformatics, Data Analytics
Big datafeature engineering and selectionhealth informaticsbioinformaticsdata mining