🤖 AI Summary
Functional ANOVA decomposition suffers from non-uniqueness in high-dimensional settings due to multiple valid decompositions, leading to unstable neural network estimates of component functions and severely undermining the reliability of interpretable AI. To address this, we propose ANOVA-NODE—a novel neural architecture that, for the first time, enforces strict uniqueness of functional ANOVA decomposition within the Neural Oblivious Decision Ensembles (NODE) framework by incorporating tensor-product structure and orthogonality constraints. We theoretically establish its optimal approximation rate and component stability. Empirically, ANOVA-NODE achieves significantly higher stability of estimated components under data perturbations and random weight initialization compared to state-of-the-art methods, while maintaining high approximation accuracy for smooth functions. This work delivers the first neural implementation of functional ANOVA decomposition that simultaneously provides rigorous theoretical guarantees and empirically demonstrated robustness—enabling reproducible, verifiable functional decomposition in practice.
📝 Abstract
Interpretability for machine learning models is becoming more and more important as machine learning models become more complex. The functional ANOVA model, which decomposes a high-dimensional function into a sum of lower dimensional functions so called components, is one of the most popular tools for interpretable AI, and recently, various neural network models have been developed for estimating each component in the functional ANOVA model. However, such neural networks are highly unstable when estimating components since the components themselves are not uniquely defined. That is, there are multiple functional ANOVA decompositions for a given function. In this paper, we propose a novel interpretable model which guarantees a unique functional ANOVA decomposition and thus is able to estimate each component stably. We call our proposed model ANOVA-NODE since it is a modification of Neural Oblivious Decision Ensembles (NODE) for the functional ANOVA model. Theoretically, we prove that ANOVA-NODE can approximate a smooth function well. Additionally, we experimentally show that ANOVA-NODE provides much more stable estimation of each component and thus much more stable interpretation when training data and initial values of the model parameters vary than existing neural network models do.