scLLM-DSC: LLM-Knowledge Enhanced Cross-Modal Deep Structural Clustering for Single-Cell RNA Sequencing

📅 2026-06-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses a critical limitation in existing single-cell RNA sequencing clustering methods, which predominantly rely on numerical statistical patterns while neglecting the biological semantics of genes, thereby creating semantic blind spots. To overcome this, the authors propose a novel framework that uniquely integrates biological prior knowledge from large language models with graph-guided encoding to construct a knowledge-driven semantic view and a structure-aware topological view. These dual representations are jointly modeled within a unified latent space through a cross-modal contrastive alignment mechanism, enabling deep structural clustering that effectively bridges the inherent gap between generative pretraining and discriminative clustering objectives. Extensive benchmark evaluations demonstrate that the proposed method significantly outperforms eleven state-of-the-art approaches, achieving substantial improvements in clustering accuracy.
📝 Abstract
Clustering is fundamental to scRNA-seq analysis, serving as a cornerstone for identifying cell populations and resolving tissue heterogeneity. However, existing methods focus on mining numerical statistical patterns, suffering from semantic agnosticism by neglecting the intrinsic biological functions encoded by genes. While Large Language Models (LLMs) offer promising semantic capabilities, their direct adaptation to cell clustering is hindered by the structural mismatch between generative pre-training objectives and discriminative downstream tasks. To bridge this gap, we propose scLLM-DSC, a novel LLM-Knowledge Enhanced Cross-Modal Deep Structural Clustering framework. Diverging from data-driven paradigms, scLLM-DSC establishes a semantically-grounded representation by synergizing two views: a Knowledge-Driven Semantic View derived from NCBI gene priors and contextualized Cell2Sentence embeddings, and a Structure-Aware Topological View extracted via a graph-guided encoder. Crucially, we introduce a cross-modal contrastive alignment mechanism to enforce consistency between biological semantics and transcriptomic features within a unified latent space. Extensive benchmarks demonstrate that scLLM-DSC significantly outperforms eleven state-of-the-art baselines in clustering accuracy.
Problem

Research questions and friction points this paper is trying to address.

single-cell RNA sequencing
clustering
semantic agnosticism
Large Language Models
biological function
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-knowledge integration
cross-modal contrastive alignment
semantic-aware clustering
graph-guided encoder
single-cell RNA sequencing
🔎 Similar Papers
2024-04-09International Conference on Database Systems for Advanced ApplicationsCitations: 7
Ping Xu
Ping Xu
Computer Network Information Center, Chinese Academy of Sciences; UCAS
Graph Neural NetworkAI4Bioinformatics
P
Pengjiang Li
Computer Network Information Center, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
T
Tian Du
Computer Network Information Center, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Zaitian Wang
Zaitian Wang
Computer Network Information Center, Chinese Academy of Sciences
Data-centric AILarge Language Models
Jiawei Gu
Jiawei Gu
Sun Yat-sen University
Natural language processingMultimodal reasoning
Ziyue Qiao
Ziyue Qiao
Assistant Professor, Great Bay University
Data MiningGraph Machine LearningKnowledge GraphAI for Science
P
Pengfei Wang
Computer Network Information Center, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
Yuanchun Zhou
Yuanchun Zhou
Computer Network Information Center,CAS
Data MiningBig Data Analysis