Model Monitoring: A General Framework with an Application to Non-life Insurance Pricing
Alexej Brauer, Paul Menzel, Mario V. Wüthrich
TL;DR
The paper tackles how non-life insurance pricing models degrade under concept drift in production data. It develops a monitoring framework that fuses a Gini-based risk-ranking test with Murphy-decomposition auto-calibration tests, underpinned by new asymptotic theory for the Gini score and bootstrap variance. The framework is model-agnostic and demonstrated on a modified motor-claim dataset with controlled drift, guiding refitting or recalibration decisions. Practical considerations and potential extensions for adaptive windowing, recurrent drift, and multi-method drift attribution are discussed to support deployment in real-world governance.
Abstract
Maintaining the predictive performance of pricing models is challenging when insurance portfolios and data-generating mechanisms evolve over time. Focusing on non-life insurance, we adopt the concept-drift terminology from machine learning and distinguish virtual drift from real concept drift in an actuarial setting. Methodologically, we (i) formalize deviance loss and Murphy's score decomposition to assess global and local auto-calibration; (ii) study the Gini score as a rank-based performance measure, derive its asymptotic distribution, and develop a consistent bootstrap estimator of its asymptotic variance; and (iii) combine these results into a statistically grounded, model-agnostic monitoring framework that integrates a Gini-based ranking drift test with global and local auto-calibration tests. An application to a modified motor insurance portfolio with controlled concept-drift scenarios illustrates how the framework guides decisions on refitting or recalibrating pricing models.
