Bayesian Multivariate Sparse Functional PCA

📅 2025-09-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of principal component analysis (PCA) for multivariate sparse functional data—such as longitudinal child growth trajectories—by proposing the first fully Bayesian inferential framework. Methodologically: (i) it explicitly parameterizes principal component functions using orthogonal spline bases; (ii) it ensures posterior identifiability and stability through ordered eigenvalue constraints, parameter-expanded Gibbs sampling, and Procrustes alignment; and (iii) it extends the FAST algorithm to the multivariate sparse setting, incorporating standardized covariates and parallel computation for scalability. The key contributions are a unified treatment of PCA structure learning and uncertainty propagation, yielding substantially improved estimation accuracy and robustness—particularly under weak signal conditions. Extensive simulations and real-world growth data analyses demonstrate consistent superiority over existing methods, balancing theoretical rigor with practical feasibility.

Technology Category

Application Category

📝 Abstract
Functional Principal Components Analysis (FPCA) provides a parsimonious, semi-parametric model for multivariate, sparsely-observed functional data. Frequentist FPCA approaches estimate principal components (PCs) from the data, then condition on these estimates in subsequent analyses. As an alternative, we propose a fully Bayesian inferential framework for multivariate, sparse functional data (MSFAST) which explicitly models the PCs and incorporates their uncertainty. MSFAST builds upon the FAST approach to FPCA for univariate, densely-observed functional data. Like FAST, MSFAST represents PCs using orthonormal splines, samples the orthonormal spline coefficients using parameter expansion, and enforces eigenvalue ordering during model fit. MSFAST extends FAST to multivariate, sparsely-observed data by (1) standardizing each functional covariate to mitigate poor posterior conditioning due to disparate scales; (2) using a better-suited orthogonal spline basis; (3) parallelizing likelihood calculations over covariates; (4) updating parameterizations and priors for computational stability; (5) using a Procrustes-based posterior alignment procedure; and (6) providing efficient prediction routines. We evaluated MSFAST alongside existing implementations using simulations. MSFAST produces uniquely valid inferences and accurate estimates, particularly for smaller signals. MSFAST is motivated by and applied to a study of child growth, with an accompanying vignette illustrating the implementation step-by-step.
Problem

Research questions and friction points this paper is trying to address.

Develops Bayesian framework for multivariate sparse functional data
Models principal components with uncertainty in FPCA
Extends FAST method to handle sparse multivariate observations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian multivariate sparse functional PCA framework
Orthonormal splines with parameter expansion sampling
Procrustes-based posterior alignment for computational stability
🔎 Similar Papers
No similar papers found.
J
Joseph Sartini
Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205
S
Scott Zeger
Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205
Ciprian Crainiceanu
Ciprian Crainiceanu
Professor of Biostatistics, Johns Hopkins University
StatisticsBiostatisticsSleepNeuroscienceFunctional data analysis