Additive Model Boosting: New Insights and Path(ologie)s

Rickmer Schulte; David Rügamer

Additive Model Boosting: New Insights and Path(ologie)s

Rickmer Schulte, David Rügamer

TL;DR

This work investigates Additive Model Boosting (BAMs), addressing the theoretical gaps in understanding their convergence and implicit regularization. It develops exact parameter-path results for $L_2$-Boosting variants, connects greedy and block-wise BAMs to generalized coordinate descent with GSQ updates, and proves linear convergence under $\mu$-PL and $L$-smoothness, with specific results for regression splines, CSS, and exponential-family losses. The analysis reveals pathologies such as convergence toward unpenalized fits for penalized base learners and potential non-convergence in certain exponential-family settings, guiding practical choices of step size and penalties. Empirical experiments validate the theory and illustrate implications for model selection, penalty design, and potential avenues for inference based on boosting paths.

Abstract

Additive models (AMs) have sparked a lot of interest in machine learning recently, allowing the incorporation of interpretable structures into a wide range of model classes. Many commonly used approaches to fit a wide variety of potentially complex additive models build on the idea of boosting additive models. While boosted additive models (BAMs) work well in practice, certain theoretical aspects are still poorly understood, including general convergence behavior and what optimization problem is being solved when accounting for the implicit regularizing nature of boosting. In this work, we study the solution paths of BAMs and establish connections with other approaches for certain classes of problems. Along these lines, we derive novel convergence results for BAMs, which yield crucial insights into the inner workings of the method. While our results generally provide reassuring theoretical evidence for the practical use of BAMs, they also uncover some ``pathologies'' of boosting for certain additive model classes concerning their convergence behavior that require caution in practice. We empirically validate our theoretical findings through several numerical experiments.

Additive Model Boosting: New Insights and Path(ologie)s

TL;DR

Abstract

Additive Model Boosting: New Insights and Path(ologie)s

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (23)