Polyhedra Encoding Transformers: Enhancing Diffusion MRI Analysis Beyond Voxel and Volumetric Embedding

📅 2025-01-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing deep learning methods for diffusion MRI (dMRI) spherical signal modeling underutilize directional priors and neglect the intrinsic spherical geometry. Method: We propose a spherical Transformer architecture based on icosahedral polyhedral encoding. Our approach introduces a novel structured resampling and embedding mechanism that maps gradient directions onto the unit sphere, explicitly incorporating spherical geometric symmetry into both positional encoding and self-attention design, while supporting multi-shell protocol adaptation during training. Contribution/Results: Evaluated on multi-compartment model parameter estimation and fiber orientation distribution (FOD) reconstruction, our method significantly outperforms CNNs and standard Transformers, achieving 12.7%–19.3% higher accuracy. It establishes a new paradigm for dMRI spherical signal modeling that jointly ensures geometric consistency and high representational capacity.

Technology Category

Application Category

📝 Abstract
Diffusion-weighted Magnetic Resonance Imaging (dMRI) is an essential tool in neuroimaging. It is arguably the sole noninvasive technique for examining the microstructural properties and structural connectivity of the brain. Recent years have seen the emergence of machine learning and data-driven approaches that enhance the speed, accuracy, and consistency of dMRI data analysis. However, traditional deep learning models often fell short, as they typically utilize pixel-level or volumetric patch-level embeddings similar to those used in structural MRI, and do not account for the unique distribution of various gradient encodings. In this paper, we propose a novel method called Polyhedra Encoding Transformer (PE-Transformer) for dMRI, designed specifically to handle spherical signals. Our approach involves projecting an icosahedral polygon onto a unit sphere to resample signals from predetermined directions. These resampled signals are then transformed into embeddings, which are processed by a transformer encoder that incorporates orientational information reflective of the icosahedral structure. Through experimental validation with various gradient encoding protocols, our method demonstrates superior accuracy in estimating multi-compartment models and Fiber Orientation Distributions (FOD), outperforming both conventional CNN architectures and standard transformers.
Problem

Research questions and friction points this paper is trying to address.

Deep Learning
Diffusion Weighted MRI
Signal Distribution
Innovation

Methods, ideas, or system contributions that make the work stand out.

Polyhedral Encoding Transformer
Diffusion Weighted MRI
Spherical Signal Processing
🔎 Similar Papers
No similar papers found.
Tianyuan Yao
Tianyuan Yao
Vanderbilt University
Machine Learningmedical image processing
Z
Zhiyuan Li
Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA
Praitayini Kanakaraj
Praitayini Kanakaraj
PhD Student, Vanderbilt University
Medical Image AnalysisNeuroimagingImaging InformaticsMachine Learning
D
D. Archer
Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, USA
K
Kurt G. Schilling
Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, USA
L
Lori Beason-Held
Laboratory of Behavioral Neuroscience, National Institute on Aging, Baltimore, MD, USA
S
Susan M. Resnick
Laboratory of Behavioral Neuroscience, National Institute on Aging, Baltimore, MD, USA
B
Bennett A. Landman
Department of Computer Science, Vanderbilt University, Nashville, TN, USA; Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA; Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, USA; Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, USA
Yuankai Huo
Yuankai Huo
Computer Science, Vanderbilt University
Medical Image AnalysisDeep LearningData Mining