Smoothness and monotonicity constraints for neural networks using ICEnet
Ronald Richman, Mario Wüthrich
TL;DR
The paper tackles the lack of principled smoothness and monotonicity constraints in neural networks for actuarial tasks by introducing ICEnet, which augments data with pseudo-data reflecting constrained variables and trains a single FCN to produce both standard predictions and ICE-based constraint outputs. The loss combines the predictive deviance $L^D$ with a smoothing term $L_2$ driven by third differences and a monotonicity term $L_3$ driven by first differences, implemented via a shared, time-distributed network. Empirical results on a French MTPL dataset show that monotonicity constraints can improve out-of-sample performance and yield more commercially interpretable ICE/PDP curves, while smoothing constraints may trade a bit of accuracy for stronger behavioral guarantees. A Local ICEnet variant further reduces computation while preserving much of the constraint-related benefits, making the approach practical for real-world deployment in actuarial pricing tasks.
Abstract
Deep neural networks have become an important tool for use in actuarial tasks, due to the significant gains in accuracy provided by these techniques compared to traditional methods, but also due to the close connection of these models to the Generalized Linear Models (GLMs) currently used in industry. Whereas constraining GLM parameters relating to insurance risk factors to be smooth or exhibit monotonicity is trivial, methods to incorporate such constraints into deep neural networks have not yet been developed. This is a barrier for the adoption of neural networks in insurance practice since actuaries often impose these constraints for commercial or statistical reasons. In this work, we present a novel method for enforcing constraints within deep neural network models, and we show how these models can be trained. Moreover, we provide example applications using real-world datasets. We call our proposed method ICEnet to emphasize the close link of our proposal to the individual conditional expectation (ICE) model interpretability technique.
