Automated Machine Learning in Insurance

Panyi Dong; Zhiyu Quan

Automated Machine Learning in Insurance

Panyi Dong, Zhiyu Quan

TL;DR

This proposed AutoML is specifically tailored for the insurance application, with features like the balancing step in data preprocessing, ensemble pipelines, and customized loss functions designed to address the unique challenges of the insurance domain.

Abstract

Machine Learning (ML) has gained popularity in actuarial research and insurance industrial applications. However, the performance of most ML tasks heavily depends on data preprocessing, model selection, and hyperparameter optimization, which are considered to be intensive in terms of domain knowledge, experience, and manual labor. Automated Machine Learning (AutoML) aims to automatically complete the full life-cycle of ML tasks and provides state-of-the-art ML models without human intervention or supervision. This paper introduces an AutoML workflow that allows users without domain knowledge or prior experience to achieve robust and effortless ML deployment by writing only a few lines of code. This proposed AutoML is specifically tailored for the insurance application, with features like the balancing step in data preprocessing, ensemble pipelines, and customized loss functions. These features are designed to address the unique challenges of the insurance domain, including the imbalanced nature of common insurance datasets. The full code and documentation are available on the GitHub repository. (https://github.com/PanyiDong/InsurAutoML)

Automated Machine Learning in Insurance

TL;DR

Abstract

Paper Structure (19 sections, 31 equations, 9 figures, 10 tables, 4 algorithms)

This paper contains 19 sections, 31 equations, 9 figures, 10 tables, 4 algorithms.

Introduction
The Concept of AutoML
Insurance Domain-Specific AutoML
Model Pipeline
Automated Optimization
Ensemble Model
Loss Functions
AutoML in Action
French Motor Third-Part Liability
Wisconsin Local Government Property Insurance Fund
Australian Automobile Insurance
AutoML as a Benchmark
Conclusion
Notations
Three ensemble strategies
...and 4 more sections

Figures (9)

Figure 1: An illustration of AutoML workflow
Figure 2: Train/Test deviance and runtime on freMTPL2freq dataset
Figure 3: Train/Test error and runtime on LGPIF dataset
Figure 4: Train/Test error and runtime on ausprivauto occurrence dataset
Figure 5: Train/Test deviance and runtime on ausprivauto frequency dataset
...and 4 more figures

Automated Machine Learning in Insurance

TL;DR

Abstract

Automated Machine Learning in Insurance

Authors

TL;DR

Abstract

Table of Contents

Figures (9)