Smooth Min-Max Monotonic Networks
Christian Igel
TL;DR
Monotonicity constraints are valuable for plausibility and fairness, but MM networks suffer from training difficulties due to silent neurons and zero gradients. The authors introduce Smooth Min-Max (SMM) networks by replacing max/min with smooth LogSumExp operators, yielding differentiable, end-to-end trainable monotone models that preserve MM's asymptotic approximation guarantees. Empirical results across univariate, multivariate, and real-world partial-monotone tasks show SMM achieving state-of-the-art or competitive generalization with lower complexity and robust training dynamics. Overall, SMM serves as a drop-in monotone module that combines simplicity, efficiency, and strong empirical performance, challenging the need for more complex monotone architectures. The work highlights the practical impact of smooth monotone designs for scientific modelling and responsible AI applications.
Abstract
Monotonicity constraints are powerful regularizers in statistical modelling. They can support fairness in computer-aided decision making and increase plausibility in data-driven scientific models. The seminal min-max (MM) neural network architecture ensures monotonicity, but often gets stuck in undesired local optima during training because of partial derivatives of the MM nonlinearities being zero. We propose a simple modification of the MM network using strictly-increasing smooth minimum and maximum functions that alleviates this problem. The resulting smooth min-max (SMM) network module inherits the asymptotic approximation properties from the MM architecture. It can be used within larger deep learning systems trained end-to-end. The SMM module is conceptually simple and computationally less demanding than state-of-the-art neural networks for monotonic modelling. Our experiments show that this does not come with a loss in generalization performance compared to alternative neural and non-neural approaches.
