Table of Contents
Fetching ...

Perturbing the Derivative: Wild Refitting for Model-Free Evaluation of Machine Learning Models under Bregman Losses

Haichen Hu, David Simchi-Levi

TL;DR

The paper addresses the challenge of obtaining high-probability excess-risk bounds for complex, opaque models trained with empirical risk minimization under general Bregman losses, without relying on global function-class structure. It introduces Wild Refitting with Bregman Loss, a model-free procedure that perturbs derivatives rather than outputs, using a randomized Rademacher scheme to form wild responses and retrain a new predictor. The core contribution is a non-asymptotic, high-probability bound on the excess risk that depends on the wild optimism term and several controllable components, valid under fixed design and black-box access to the trainer. This framework enables principled model evaluation for deep neural networks and generative models, with potential extensions to random design and high-dimensional settings, and invites empirical validation on real-world opaque models.

Abstract

We study the excess risk evaluation of classical penalized empirical risk minimization (ERM) with Bregman losses. We show that by leveraging the idea of wild refitting, one can efficiently upper bound the excess risk through the so-called "wild optimism," without relying on the global structure of the underlying function class. This property makes our approach inherently model-free. Unlike conventional analysis, our framework operates with just one dataset and black-box access to the training procedure. The method involves randomized Rademacher symmetrization and constructing artificially modified outputs by perturbation in the derivative space with appropriate scaling, upon which we retrain a second predictor for excess risk estimation. We establish high-probability performance guarantee under the fixed design setting, demonstrating that wild refitting under Bregman losses, with an appropriately chosen wild noise scale, yields a valid upper bound on the excess risk. Thus, our work is promising for theoretically evaluating modern opaque ML models, such as deep neural networks and generative models, where the function class is too complex for classical learning theory and empirical process techniques.

Perturbing the Derivative: Wild Refitting for Model-Free Evaluation of Machine Learning Models under Bregman Losses

TL;DR

The paper addresses the challenge of obtaining high-probability excess-risk bounds for complex, opaque models trained with empirical risk minimization under general Bregman losses, without relying on global function-class structure. It introduces Wild Refitting with Bregman Loss, a model-free procedure that perturbs derivatives rather than outputs, using a randomized Rademacher scheme to form wild responses and retrain a new predictor. The core contribution is a non-asymptotic, high-probability bound on the excess risk that depends on the wild optimism term and several controllable components, valid under fixed design and black-box access to the trainer. This framework enables principled model evaluation for deep neural networks and generative models, with potential extensions to random design and high-dimensional settings, and invites empirical validation on real-world opaque models.

Abstract

We study the excess risk evaluation of classical penalized empirical risk minimization (ERM) with Bregman losses. We show that by leveraging the idea of wild refitting, one can efficiently upper bound the excess risk through the so-called "wild optimism," without relying on the global structure of the underlying function class. This property makes our approach inherently model-free. Unlike conventional analysis, our framework operates with just one dataset and black-box access to the training procedure. The method involves randomized Rademacher symmetrization and constructing artificially modified outputs by perturbation in the derivative space with appropriate scaling, upon which we retrain a second predictor for excess risk estimation. We establish high-probability performance guarantee under the fixed design setting, demonstrating that wild refitting under Bregman losses, with an appropriately chosen wild noise scale, yields a valid upper bound on the excess risk. Thus, our work is promising for theoretically evaluating modern opaque ML models, such as deep neural networks and generative models, where the function class is too complex for classical learning theory and empirical process techniques.

Paper Structure

This paper contains 20 sections, 19 theorems, 131 equations, 1 figure, 1 algorithm.

Key Result

Proposition 3.2

If $\phi$ is $\beta$-smooth, then $\forall\ y$, $D_{\phi}(\cdot,y)$ is $\beta$-smooth with respect to the first variable.

Figures (1)

  • Figure 1: Illustration of Lemma \ref{['lemma:W_n<Opt']}

Theorems & Definitions (28)

  • Proposition 3.2
  • Proposition 3.3
  • Proposition 3.4
  • Example 3.5
  • Example 3.6
  • Example 3.7
  • Proposition 3.8
  • Remark 4.1
  • Lemma 5.1
  • Theorem 5.2
  • ...and 18 more