Functional Frank-Wolfe Boosting for General Loss Functions

Chu Wang; Yingfei Wang; Weinan E; Robert Schapire

Functional Frank-Wolfe Boosting for General Loss Functions

Chu Wang, Yingfei Wang, Weinan E, Robert Schapire

TL;DR

Boosting's risk of overfitting, especially in regression, motivates an $l_1$-constrained approach. The authors develop FWBoost, a functional Frank-Wolfe boosting algorithm for general loss functions, yielding an $l_1$-regularized framework that reduces to AdaBoost under exponential loss and has an $O(1/t)$ convergence rate. They establish Rademacher-based generalization bounds independent of boosting iterations and provide an away-steps variant for sparsity. Empirical results on UCI datasets show FWBoost maintains test performance with increasing rounds and can outperform regularized gradient boosting, validating both theory and practicality. The work offers a scalable, principled boosting paradigm with theoretical guarantees for broad loss landscapes.

Abstract

Boosting is a generic learning method for classification and regression. Yet, as the number of base hypotheses becomes larger, boosting can lead to a deterioration of test performance. Overfitting is an important and ubiquitous phenomenon, especially in regression settings. To avoid overfitting, we consider using $l_1$ regularization. We propose a novel Frank-Wolfe type boosting algorithm (FWBoost) applied to general loss functions. By using exponential loss, the FWBoost algorithm can be rewritten as a variant of AdaBoost for binary classification. FWBoost algorithms have exactly the same form as existing boosting methods, in terms of making calls to a base learning algorithm with different weights update. This direct connection between boosting and Frank-Wolfe yields a new algorithm that is as practical as existing boosting methods but with new guarantees and rates of convergence. Experimental results show that the test performance of FWBoost is not degraded with larger rounds in boosting, which is consistent with the theoretical analysis.

Functional Frank-Wolfe Boosting for General Loss Functions

TL;DR

Boosting's risk of overfitting, especially in regression, motivates an

-constrained approach. The authors develop FWBoost, a functional Frank-Wolfe boosting algorithm for general loss functions, yielding an

-regularized framework that reduces to AdaBoost under exponential loss and has an

convergence rate. They establish Rademacher-based generalization bounds independent of boosting iterations and provide an away-steps variant for sparsity. Empirical results on UCI datasets show FWBoost maintains test performance with increasing rounds and can outperform regularized gradient boosting, validating both theory and practicality. The work offers a scalable, principled boosting paradigm with theoretical guarantees for broad loss landscapes.

Abstract

regularization. We propose a novel Frank-Wolfe type boosting algorithm (FWBoost) applied to general loss functions. By using exponential loss, the FWBoost algorithm can be rewritten as a variant of AdaBoost for binary classification. FWBoost algorithms have exactly the same form as existing boosting methods, in terms of making calls to a base learning algorithm with different weights update. This direct connection between boosting and Frank-Wolfe yields a new algorithm that is as practical as existing boosting methods but with new guarantees and rates of convergence. Experimental results show that the test performance of FWBoost is not degraded with larger rounds in boosting, which is consistent with the theoretical analysis.

Functional Frank-Wolfe Boosting for General Loss Functions

TL;DR

Abstract

Functional Frank-Wolfe Boosting for General Loss Functions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (4)