Efficient Ensembles Improve Training Data Attribution
Junwei Deng, Ting-Wei Li, Shichang Zhang, Jiaqi Ma
TL;DR
This work targets the efficiency gap in training data attribution (TDA) for large-scale models by moving beyond costly fully independent ensembles of gradient-based methods. It introduces two efficient ensemble strategies—Dropout Ensemble and LoRA Ensemble—that replace expensive independent retraining with cost-effective variants, achieving substantial reductions in training time, serving time, and space while preserving attribution efficacy as measured by LDS. Empirical results across datasets (MNIST-10, CIFAR-2, MAESTRO) and models (MLPs, ResNet-9, Music Transformer) demonstrate that the proposed methods outperform naive ensembles and maintain strong TDA performance, including in generative settings. These approaches advance the Pareto frontier of TDA methods, enabling scalable, accurate data attribution in large-scale and edge contexts, with potential applicability to a wide range of gradient-based TDA techniques.
Abstract
Training data attribution (TDA) methods aim to quantify the influence of individual training data points on the model predictions, with broad applications in data-centric AI, such as mislabel detection, data selection, and copyright compensation. However, existing methods in this field, which can be categorized as retraining-based and gradient-based, have struggled with the trade-off between computational efficiency and attribution efficacy. Retraining-based methods can accurately attribute complex non-convex models but are computationally prohibitive, while gradient-based methods are efficient but often fail for non-convex models. Recent research has shown that augmenting gradient-based methods with ensembles of multiple independently trained models can achieve significantly better attribution efficacy. However, this approach remains impractical for very large-scale applications. In this work, we discover that expensive, fully independent training is unnecessary for ensembling the gradient-based methods, and we propose two efficient ensemble strategies, DROPOUT ENSEMBLE and LORA ENSEMBLE, alternative to naive independent ensemble. These strategies significantly reduce training time (up to 80%), serving time (up to 60%), and space cost (up to 80%) while maintaining similar attribution efficacy to the naive independent ensemble. Our extensive experimental results demonstrate that the proposed strategies are effective across multiple TDA methods on diverse datasets and models, including generative settings, significantly advancing the Pareto frontier of TDA methods with better computational efficiency and attribution efficacy.
