Bayesian Group Regularization in Generalized Linear Models with a Continuous Spike-and-Slab Prior

šŸ“… 2020-07-14
šŸ“ˆ Citations: 1
✨ Influential: 0
šŸ“„ PDF
šŸ¤– AI Summary
This paper addresses Bayesian group-sparse variable selection in high-dimensional generalized linear models (GLMs), unifying logistic, Poisson, negative binomial, and Gaussian regression under canonical or non-canonical link functions. We propose a Bayesian group-regularization framework based on continuous spike-and-slab priors. For the first time, we establish that its maximum a posteriori (MAP) estimator achieves the same minimax Lā‚‚ convergence rate as the full posterior distribution, while the posterior contraction rate strictly dominates that of group Lasso. Computationally, the method integrates an EM algorithm with MCMC sampling to ensure both feasibility and theoretical rigor. Extensive simulations and real-data analysis on HIV drug resistance prediction demonstrate that the proposed approach significantly enhances robustness, statistical accuracy, and interpretability in identifying high-dimensional protein sequence features.
šŸ“ Abstract
We study Bayesian group-regularized estimation in high-dimensional generalized linear models (GLMs) under a continuous spike-and-slab prior. Our framework covers both canonical and non-canonical link functions and subsumes logistic, Poisson, negative binomial, and Gaussian regression with group sparsity. We obtain the minimax L2 convergence rate for both a maximum a posteriori (MAP) estimator and the full posterior distribution under our prior. Our theoretical results thus justify the use of the posterior mode as a point estimator. The posterior distribution also contracts at the same rate as the MAP estimator, an attractive feature of our approach which is not the case for the group lasso. For computation, we propose expectation-maximization (EM) and Markov chain Monte Carlo (MCMC) algorithms. We illustrate our method through simulations and a real data application on predicting human immunodeficiency virus (HIV) drug resistance from protein sequences.
Problem

Research questions and friction points this paper is trying to address.

Bayesian group-regularized estimation in high-dimensional GLMs
Minimax L2 convergence rate for MAP and posterior
EM and MCMC algorithms for computation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian group-regularized estimation in GLMs
Continuous spike-and-slab prior for sparsity
EM and MCMC algorithms for computation
šŸ”Ž Similar Papers
No similar papers found.