Constructive interpolation and generalization rates for neural ODEs: a control perspective

📅 2026-05-29
📈 Citations: 0
Influential: 0
📄 PDF

career value

187K/year
🤖 AI Summary
This work investigates the interpolation–generalization trade-off of semi-autonomous neural ordinary differential equations (SA-NODEs) in supervised regression from a control-theoretic perspective. By introducing the novel notion of simultaneous cell controllability (SCC), the authors constructively demonstrate that SA-NODEs can achieve exact interpolation while emulating nonparametric estimators, thereby yielding quantifiable generalization bounds. The theoretical analysis reveals that explicit time dependence is a crucial mechanism enabling SA-NODEs to simultaneously attain strong interpolation capability and favorable generalization performance, highlighting inherent structural limitations of autonomous neural ODEs. The derived generalization rates match those of classical histogram and nearest-neighbor estimators, and empirical results further confirm the superiority of SA-NODEs in test error.
📝 Abstract
We study supervised regression with neural ODEs (NODEs) from a control-theoretic perspective to derive explicit population-risk bounds. We focus on a widely used class of non-autonomous models with constant parameters and explicit time dependence, which we call semi-autonomous NODEs (SA-NODEs). We constructively prove that SA-NODEs are capable of \emph{exact} interpolation of admissible finite datasets, and even satisfy a stronger property that we call \emph{simultaneous cell controllability} (SCC): their flows can map prescribed disjoint cells into arbitrarily small target balls. This property is the mechanism that upgrades interpolation into quantitative generalization, by allowing SA-NODEs to emulate piecewise-constant nonparametric estimators. Consequently, our risk bounds recover the rates of histogram and nearest-neighbor estimators, provided the network width satisfies a conservative scaling with the sample size. Numerical experiments show that trained SA-NODEs achieve competitive -- often lower -- test errors than these baselines. Finally, we show that the explicit time dependence is essential. Although two-layer autonomous NODEs can interpolate geometrically nondegenerate datasets, structural obstructions prevent them from achieving SCC. These limitations, further confirmed numerically, support the view that SA-NODEs provide a minimal effective architecture for learning.
Problem

Research questions and friction points this paper is trying to address.

neural ODEs
interpolation
generalization
controllability
supervised regression
Innovation

Methods, ideas, or system contributions that make the work stand out.

neural ODEs
simultaneous cell controllability
generalization bounds
semi-autonomous dynamics
nonparametric estimation
🔎 Similar Papers
2023-12-09arXiv.orgCitations: 2
A
Antonio Álvarez-López
Chair for Dynamics, Control, Machine Learning, and Numerics (Alexander von Humboldt Professorship), Department of Mathematics, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany; Departamento de Matemáticas, Universidad Autónoma de Madrid, 28049 Madrid, Spain
L
Lorenzo Liverani
Chair for Dynamics, Control, Machine Learning, and Numerics (Alexander von Humboldt Professorship), Department of Mathematics, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany
Enrique Zuazua
Enrique Zuazua
FAU Erlangen/Humboldt Prof. & Deusto-Bilbao & UAM-Madrid & Sherpa.ai Chief Algorithm Scientist
Applied MathematicsPartial Differential EquationsNumericsControlMachine Learning