A mathematical certification for positivity conditions in Neural Networks with applications to partial monotonicity and Trustworthy AI
Alejandro Polo-Molina, David Alfaya, Jose Portela
TL;DR
The paper tackles the challenge of certifying partial monotonicity and related properties for black-box neural networks without constraining architectures. It introduces LipVor, a positivity-certification algorithm leveraging Lipschitz continuity to extend pointwise positivity to neighborhoods and a Voronoi-based global coverage test, providing a finite-step certificate or counterexamples. It additionally derives an upper bound for the Lipschitz constant of the partial derivatives of an ANN, enabling derivative-based monotonicity certification, and extends the framework to training unconstrained monotonic networks via an $oldsymbol{\varepsilon}$-monotonic penalty with iterative certification. Through heat-equation, ESL, and AutoMPG case studies, LipVor demonstrates the ability to certify partial monotonicity, locate counterexamples, and guide model refinement, highlighting potential for trustworthy AI in regulated domains. The work suggests broader applicability to convexity and other properties, and discusses computational strategies to scale Voronoi-based certification in practice.
Abstract
Artificial Neural Networks (ANNs) have become a powerful tool for modeling complex relationships in large-scale datasets. However, their black-box nature poses trustworthiness challenges. In certain situations, ensuring trust in predictions might require following specific partial monotonicity constraints. However, certifying if an already-trained ANN is partially monotonic is challenging. Therefore, ANNs are often disregarded in some critical applications, such as credit scoring, where partial monotonicity is required. To address this challenge, this paper presents a novel algorithm (LipVor) that certifies if a black-box model, such as an ANN, is positive based on a finite number of evaluations. Consequently, since partial monotonicity can be expressed as a positivity condition on partial derivatives, LipVor can certify whether an ANN is partially monotonic. To do so, for every positively evaluated point, the Lipschitzianity of the black-box model is used to construct a specific neighborhood where the function remains positive. Next, based on the Voronoi diagram of the evaluated points, a sufficient condition is stated to certify if the function is positive in the domain. Unlike prior methods, our approach certifies partial monotonicity without constrained architectures or piece-wise linear activations. Therefore, LipVor could open up the possibility of using unconstrained ANN in some critical fields. Moreover, some other properties of an ANN, such as convexity, can be posed as positivity conditions, and therefore, LipVor could also be applied.
