Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics

📅 2025-09-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the lack of systematic evaluation of dimensionality reduction methods in spatial transcriptomics. We establish a unified framework to benchmark PCA, NMF, autoencoders, VAEs, and hybrid embeddings across multiple parameter configurations. Methodologically, we introduce two biology-driven metrics—Cluster Marker Coherence (CMC) and Marker Exclusion Rate (MER)—and employ Pareto-optimal analysis for principled hyperparameter selection. Comprehensive evaluation integrates reconstruction error, explained variance, clustering consistency, and biological fidelity. Results show that NMF achieves superior marker gene enrichment; VAE attains the best trade-off between reconstruction quality and interpretability; and MER-guided reassignment improves average CMC by 12%, markedly enhancing spatial–molecular alignment. This work establishes a reproducible, scalable paradigm for biologically informed assessment of dimensionality reduction techniques in spatial omics.

Technology Category

Application Category

📝 Abstract
We introduce a unified framework for evaluating dimensionality reduction techniques in spatial transcriptomics beyond standard PCA approaches. We benchmark six methods PCA, NMF, autoencoder, VAE, and two hybrid embeddings on a cholangiocarcinoma Xenium dataset, systematically varying latent dimensions ($k$=5-40) and clustering resolutions ($ρ$=0.1-1.2). Each configuration is evaluated using complementary metrics including reconstruction error, explained variance, cluster cohesion, and two novel biologically-motivated measures: Cluster Marker Coherence (CMC) and Marker Exclusion Rate (MER). Our results demonstrate distinct performance profiles: PCA provides a fast baseline, NMF maximizes marker enrichment, VAE balances reconstruction and interpretability, while autoencoders occupy a middle ground. We provide systematic hyperparameter selection using Pareto optimal analysis and demonstrate how MER-guided reassignment improves biological fidelity across all methods, with CMC scores improving by up to 12% on average. This framework enables principled selection of dimensionality reduction methods tailored to specific spatial transcriptomics analyses.
Problem

Research questions and friction points this paper is trying to address.

Evaluating dimensionality reduction techniques for spatial transcriptomics
Benchmarking six methods on cholangiocarcinoma dataset
Assessing performance using reconstruction and biological metrics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Benchmarking six dimensionality reduction methods
Introducing novel biologically-motivated evaluation metrics
Providing systematic hyperparameter selection framework
🔎 Similar Papers
No similar papers found.
M
Md Ishtyaq Mahmud
Department of Electrical and Computer Engineering, University of Houston, Houston, TX, USA
V
Veena Kochat
Department of Genomic Medicine and MDACC Epigenomics Therapy Initiative (METI), MD Anderson Cancer Center, Houston, TX, USA
S
Suresh Satpati
Department of Genomic Medicine and MDACC Epigenomics Therapy Initiative (METI), MD Anderson Cancer Center, Houston, TX, USA
J
Jagan Mohan Reddy Dwarampudi
Department of Electrical and Computer Engineering, University of Houston, Houston, TX, USA
Kunal Rai
Kunal Rai
Professor, MD Anderson Cancer Center
ChromatinCancerMetastasisMelanomaEpigenetics
Tania Banerjee
Tania Banerjee
University of Houston
AI/MLSmart HealthcareData ScienceHPCIntelligent Transportation Systems