Conformal prediction for frequency-severity modeling
Helton Graziadei, Paulo C. Marques F., Eduardo F. L. de Melo, Rodrigo S. Targino
TL;DR
This paper develops a model-agnostic conformal prediction framework to construct finite-sample prediction intervals for two-stage frequency-severity insurance models, applicable to both parametric and machine-learning severity predictors. It extends split conformal prediction to jointly handle frequency and severity with a residual-based conformity scheme, guaranteeing coverage at $1-\alpha$ up to a tunable finite-sample bound, and introduces a two-stage out-of-bag extension using random forests to avoid calibration data altogether. Through synthetic data and real insurance datasets (MTPL Belgium and Brazilian crop insurance), the approach demonstrates comparable coverage with substantially narrower intervals when using random forests for severity, and further gains when employing the out-of-bag variant. The work concludes that conformal prediction provides reliable uncertainty quantification for frequency-severity modeling, with practical impact for risk pricing and reserving, and offers open-source software to reproduce the results.
Abstract
We present a model-agnostic framework for the construction of prediction intervals of insurance claims, with finite sample statistical guarantees, extending the technique of split conformal prediction to the domain of two-stage frequency-severity modeling. The framework effectiveness is showcased with simulated and real datasets using classical parametric models and contemporary machine learning methods. When the underlying severity model is a random forest, we extend the two-stage split conformal prediction algorithm, showing how the out-of-bag mechanism can be leveraged to eliminate the need for a calibration set in the conformal procedure.
