🤖 AI Summary
This work addresses the challenge of enforcing input-dependent logical and mixed-integer linear rule constraints in neural networks under data-scarce conditions, where conventional approaches often fail to guarantee strict constraint satisfaction. The authors propose an end-to-end differentiable framework that encodes domain knowledge as disjunctive constraints, derives their convex hull representations via hierarchical convex relaxations, and embeds them as differentiable optimization layers. This approach enables the first exact, hard enforcement of input-dependent mixed-integer logical rules within neural networks while preserving differentiability—overcoming limitations of existing soft-penalty or non-differentiable post-processing methods. Empirical results on real-world datasets demonstrate that the model achieves 100% rule compliance without compromising predictive accuracy.
📝 Abstract
Many learning tasks in science and engineering are characterized by sparse datasets, which limits the effectiveness of purely data-driven approaches. At the same time, these problems are often accompanied by rich domain knowledge derived from physical laws, operational requirements, and expert heuristics. Such knowledge is frequently expressed as rules involving logical propositions and linear inequalities. Existing neuro-symbolic methods typically enforce these rules approximately through soft penalties, assume input-independent rules when designing specialized architectures, or rely on non-differentiable post-processing at inference time to achieve hard constraint satisfaction. While recent advances in differentiable optimization layers enable end-to-end feasibility enforcement within neural networks, extending these approaches to logical or mixed-integer rules remains challenging due to inherent nonconvexity. In this work, we propose a unified end-to-end framework for enforcing hard, input-dependent mixed integer linear constraints within neural networks. Our approach represents rules as disjunctive constraints and applies hierarchical convex relaxations to obtain convex hull formulations. These relaxations yield tractable linear constraints that can be embedded as differentiable optimization layers while enabling exact rule satisfaction. We demonstrate the effectiveness of the proposed framework on real-world datasets, achieving perfect rule satisfaction and strong predictive performance.