🤖 AI Summary
This paper addresses the challenge of principal component analysis (PCA) for multivariate sparse functional data—such as longitudinal child growth trajectories—by proposing the first fully Bayesian inferential framework. Methodologically: (i) it explicitly parameterizes principal component functions using orthogonal spline bases; (ii) it ensures posterior identifiability and stability through ordered eigenvalue constraints, parameter-expanded Gibbs sampling, and Procrustes alignment; and (iii) it extends the FAST algorithm to the multivariate sparse setting, incorporating standardized covariates and parallel computation for scalability. The key contributions are a unified treatment of PCA structure learning and uncertainty propagation, yielding substantially improved estimation accuracy and robustness—particularly under weak signal conditions. Extensive simulations and real-world growth data analyses demonstrate consistent superiority over existing methods, balancing theoretical rigor with practical feasibility.
📝 Abstract
Functional Principal Components Analysis (FPCA) provides a parsimonious, semi-parametric model for multivariate, sparsely-observed functional data. Frequentist FPCA approaches estimate principal components (PCs) from the data, then condition on these estimates in subsequent analyses. As an alternative, we propose a fully Bayesian inferential framework for multivariate, sparse functional data (MSFAST) which explicitly models the PCs and incorporates their uncertainty. MSFAST builds upon the FAST approach to FPCA for univariate, densely-observed functional data. Like FAST, MSFAST represents PCs using orthonormal splines, samples the orthonormal spline coefficients using parameter expansion, and enforces eigenvalue ordering during model fit. MSFAST extends FAST to multivariate, sparsely-observed data by (1) standardizing each functional covariate to mitigate poor posterior conditioning due to disparate scales; (2) using a better-suited orthogonal spline basis; (3) parallelizing likelihood calculations over covariates; (4) updating parameterizations and priors for computational stability; (5) using a Procrustes-based posterior alignment procedure; and (6) providing efficient prediction routines. We evaluated MSFAST alongside existing implementations using simulations. MSFAST produces uniquely valid inferences and accurate estimates, particularly for smaller signals. MSFAST is motivated by and applied to a study of child growth, with an accompanying vignette illustrating the implementation step-by-step.