pared: Model selection using multi-objective optimization

📅 2025-05-27
📈 Citations: 0
Influential: 0
📄 PDF

career value

220K/year
🤖 AI Summary
In statistical modeling, conventional penalized model selection relies on a single criterion (e.g., prediction error), failing to jointly optimize sparsity, interpretability, and smoothness. To address this, we propose a Bayesian multi-objective model selection framework that—novelty—integrates Gaussian process surrogate models into multi-objective hyperparameter tuning, enabling systematic identification of the Pareto-optimal front. The method accommodates mainstream penalized estimators, including elastic net and fused lasso. Implemented in the R package *pared*, it automates Pareto-front discovery and supports interactive visualization via *ggplot2* and *plotly*. Extensive experiments on synthetic and real-world datasets demonstrate that our framework significantly improves the simultaneous optimization of goodness-of-fit, sparsity, and smoothness. By unifying these competing objectives within a coherent probabilistic framework, it establishes a new paradigm for interpretable machine learning.

Technology Category

Application Category

📝 Abstract
Motivation: Model selection is a ubiquitous challenge in statistics. For penalized models, model selection typically entails tuning hyperparameters to maximize a measure of fit or minimize out-of-sample prediction error. However, these criteria fail to reflect other desirable characteristics, such as model sparsity, interpretability, or smoothness. Results: We present the R package pared to enable the use of multi-objective optimization for model selection. Our approach entails the use of Gaussian process-based optimization to efficiently identify solutions that represent desirable trade-offs. Our implementation includes popular models with multiple objectives including the elastic net, fused lasso, fused graphical lasso, and group graphical lasso. Our R package generates interactive graphics that allow the user to identify hyperparameter values that result in fitted models which lie on the Pareto frontier. Availability: We provide the R package pared and vignettes illustrating its application to both simulated and real data at https://github.com/priyamdas2/pared.
Problem

Research questions and friction points this paper is trying to address.

Model selection lacks consideration for sparsity and interpretability
Multi-objective optimization improves trade-offs in model selection
R package enables efficient Pareto frontier visualization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-objective optimization for model selection
Gaussian process-based optimization for trade-offs
Interactive graphics for Pareto frontier analysis
🔎 Similar Papers
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
P
Priyam Das
Department of Biostatistics, Virgina Commonwealth University
S
Sarah Robinson
Department of Statistics, Rice University
Christine B. Peterson
Christine B. Peterson
Associate Professor of Biostatistics, University of Texas MD Anderson Cancer Center
graphical modelsvariable selectionBayesian statisticsmicrobiome data