Statistical learning for constrained functional parameters in infinite-dimensional models with applications in fair machine learning

📅 2024-04-15
🏛️ arXiv.org
📈 Citations: 2
Influential: 0
📄 PDF

career value

191K/year
🤖 AI Summary
This paper addresses functional parameter estimation in infinite-dimensional statistical models subject to equality/inequality constraints—such as fairness criteria, moment conditions, or calibration requirements. We propose a Lagrange multiplier–based penalized risk minimization framework that yields closed-form estimators for constrained functional parameters, marking the first explicit, fairness-driven modeling approach in nonparametric and semiparametric settings. Our method follows a two-stage plug-in paradigm compatible with arbitrary black-box learners. Theoretical analysis integrates statistical functional theory with constrained optimization, establishing estimator consistency and asymptotic normality. Empirical evaluation under multiple fairness constraints—including statistical parity and equal opportunity—demonstrates substantial fairness improvement without compromising predictive accuracy.

Technology Category

Application Category

📝 Abstract
Constrained learning has become increasingly important, especially in the realm of algorithmic fairness and machine learning. In these settings, predictive models are developed specifically to satisfy pre-defined notions of fairness. Here, we study the general problem of constrained statistical machine learning through a statistical functional lens. We consider learning a function-valued parameter of interest under the constraint that one or several pre-specified real-valued functional parameters equal zero or are otherwise bounded. We characterize the constrained functional parameter as the minimizer of a penalized risk criterion using a Lagrange multiplier formulation. We show that closed-form solutions for the optimal constrained parameter are often available, providing insight into mechanisms that drive fairness in predictive models. Our results also suggest natural estimators of the constrained parameter that can be constructed by combining estimates of unconstrained parameters of the data generating distribution. Thus, our estimation procedure for constructing fair machine learning algorithms can be applied in conjunction with any statistical learning approach and off-the-shelf software. We demonstrate the generality of our method by explicitly considering a number of examples of statistical fairness constraints and implementing the approach using several popular learning approaches.
Problem

Research questions and friction points this paper is trying to address.

Estimating function-valued parameters under constraints in infinite-dimensional models
Addressing constrained learning problems with structural requirements in statistics
Developing a framework for optimal risk and constraint satisfaction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lagrange-type penalized risk formulation
Constraint-specific path in parameter space
Recursive representations for tractable estimation
🔎 Similar Papers
No similar papers found.
Razieh Nabi
Razieh Nabi
Rollins Assistant Professor of Biostatistics, Emory University
Causal InferenceMissing DataAlgorithmic FairnessGraphical ModelsSemiparametric Statistics
N
N. Hejazi
Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
M
M. J. Laan
Division of Biostatistics, School of Public Health, University of California, Berkeley, Berkeley, CA, USA
D
David C. Benkeser
Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, USA