Integrating gene regulatory priors into Transformer attention with scTransformer for interpretable scRNA-seq analysis

📅 2026-06-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing Transformer-based methods for single-cell transcriptomic analysis overlook gene regulatory relationships, limiting model interpretability and robustness. This work proposes scTransformer, which, for the first time, explicitly integrates prior knowledge of gene regulatory networks into the Transformer’s attention mechanism by constraining information flow to align with known regulatory structures, thereby learning biologically meaningful cell representations. Evaluated under both self-supervised and supervised training frameworks on single-nucleus RNA-seq data, scTransformer significantly outperforms standard Transformers: it achieves higher cell-type classification accuracy, enhances separation of cell types in embedding space, and yields attention patterns that closely recapitulate established regulatory programs. By unifying high performance with strong interpretability, scTransformer advances the development of biologically trustworthy foundation models for single-cell genomics.
📝 Abstract
Motivation: Transformer-based models are increasingly applied to large-scale single-cell transcriptomics, showing strong performance through self-supervised learning on millions of cells. However, most existing approaches treat genes as independent features, and largely ignore prior biological knowledge, which limits interpretability and robustness. In this paper, we explore whether explicitly incorporating gene regulatory information can improve both model performance and biological insight. Results: We present scTransformer, the first Transformer-based approach that builds a priori knowledge of biological mechanisms into the model's attention patterns. By constraining information flow according to known regulatory structures, the model learns representations that are more biologically meaningful. We evaluate scTransformer on a disease-relevant single-nucleus RNA-seq dataset using supervised cell-type classification. Compared to standard Transformers, our approach improves classification accuracy, enhances separation of cell types in embedding space, and produces attention patterns consistent with known regulatory programs. Overall, our results demonstrate that embedding biological structure into Transformer models can enhance interpretability without sacrificing performance, offering a principled step toward biologically grounded foundation models for single-cell omics.
Problem

Research questions and friction points this paper is trying to address.

gene regulatory priors
Transformer attention
scRNA-seq analysis
interpretability
biological knowledge integration
Innovation

Methods, ideas, or system contributions that make the work stand out.

scTransformer
gene regulatory priors
Transformer attention
interpretable scRNA-seq
biologically grounded models
🔎 Similar Papers
No similar papers found.
M
Mikele Milia
Department of Information Engineering, University of Padova, Padova, Italy
Louis Fabrice Tshimanga
Louis Fabrice Tshimanga
Università degli Studi di Padova
Deep learningNeuroscience
Henning Müller
Henning Müller
Professor in computer science at HES-SO & in radiology at the University of Geneva
Machine learningmedical informaticsinformation retrievalcontent-based image retrievalmedical imaging
Manfredo Atzori
Manfredo Atzori
HESSO Valais, University of Padova
Biomedical data analysismultimodal machine learningEMG & prosthetics
B
Barbara Di Camillo
Department of Information Engineering, University of Padova, Padova, Italy; Department of Comparative Biomedicine and Food Science, University of Padova, Padova, Italy; Padua Center for Network Medicine, University of Padova, Padova, Italy