Bayesian Influence Functions for Hessian-Free Data Attribution
Philipp Alexander Kreer, Wilson Wu, Maxwell Adam, Zach Furman, Jesse Hoogland
TL;DR
This work introduces the local Bayesian influence function (BIF), a Hessian-free training data attribution method that replaces Hessian inversion with covariance estimation over a localized posterior, enabling scalable data attribution for deep networks with billions of parameters. By leveraging SGLD-based covariance estimation, the local BIF captures higher-order interactions in the loss landscape and reduces to the classical influence function in non-singular settings, providing a principled generalization for modern DNNs. Empirically, the method matches or exceeds state-of-the-art Hessian-based baselines on retraining-prediction benchmarks, offers fine-grained per-token attribution in language models, and scales more favorably as model size grows. The approach is architecture-agnostic, provides interpretable visualizations, and opens avenues for dynamic, checkpoint-level data attribution, with practical trade-offs in sampling cost and hyperparameter sensitivity.
Abstract
Classical influence functions face significant challenges when applied to deep neural networks, primarily due to non-invertible Hessians and high-dimensional parameter spaces. We propose the local Bayesian influence function (BIF), an extension of classical influence functions that replaces Hessian inversion with loss landscape statistics that can be estimated via stochastic-gradient MCMC sampling. This Hessian-free approach captures higher-order interactions among parameters and scales efficiently to neural networks with billions of parameters. We demonstrate state-of-the-art results on predicting retraining experiments.
