Learning to optimize with guarantees: a complete characterization of linearly convergent algorithms

📅 2025-08-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Linearly convergent optimization algorithms often exhibit suboptimal average-case performance on practical problem instances, yet their worst-case convergence guarantees—critical for safety-critical applications—must be preserved. Method: We propose a safety-enhancing framework grounded in nonsmooth composite optimization, which, for the first time, fully characterizes the class of admissible update-rule transformations that preserve linear convergence. This unified characterization encompasses foundational algorithms—including gradient descent, Nesterov’s accelerated method, and projection-based methods—as special cases. Contribution/Results: Our framework strictly maintains theoretical linear convergence rates while significantly accelerating empirical convergence in demanding settings such as ill-conditioned linear systems solving and model predictive control. Experiments demonstrate substantial improvements in average solution efficiency under finite iteration budgets, achieving both rigorous worst-case guarantees and strong practical performance.

Technology Category

Application Category

📝 Abstract
In high-stakes engineering applications, optimization algorithms must come with provable worst-case guarantees over a mathematically defined class of problems. Designing for the worst case, however, inevitably sacrifices performance on the specific problem instances that often occur in practice. We address the problem of augmenting a given linearly convergent algorithm to improve its average-case performance on a restricted set of target problems - for example, tailoring an off-the-shelf solver for model predictive control (MPC) for an application to a specific dynamical system - while preserving its worst-case guarantees across the entire problem class. Toward this goal, we characterize the class of algorithms that achieve linear convergence for classes of nonsmooth composite optimization problems. In particular, starting from a baseline linearly convergent algorithm, we derive all - and only - the modifications to its update rule that maintain its convergence properties. Our results apply to augmenting legacy algorithms such as gradient descent for nonconvex, gradient-dominated functions; Nesterov's accelerated method for strongly convex functions; and projected methods for optimization over polyhedral feasibility sets. We showcase effectiveness of the approach on solving optimization problems with tight iteration budgets in application to ill-conditioned systems of linear equations and MPC for linear systems.
Problem

Research questions and friction points this paper is trying to address.

Enhancing average-case performance while preserving worst-case guarantees
Characterizing linearly convergent algorithms for nonsmooth composite optimization
Tailoring optimization algorithms for specific applications like MPC
Innovation

Methods, ideas, or system contributions that make the work stand out.

Augmenting algorithms for better average-case performance
Preserving worst-case guarantees in optimization
Characterizing linearly convergent algorithm modifications
🔎 Similar Papers
No similar papers found.
A
Andrea Martin
School of Electrical Engineering and Computer Science, and Digital Futures, KTH Royal Institute of Technology, Sweden
Ian R. Manchester
Ian R. Manchester
Professor, University of Sydney. Director, ACFR. Director, ARIAM Hub.
roboticsnonlinear controlsystem identificationrobust machine learning
L
Luca Furieri
Department of Engineering Sciences of the University of Oxford, United Kingdom