Fast Data Attribution for Text-to-Image Models
Sheng-Yu Wang, Aaron Hertzmann, Alexei A Efros, Richard Zhang, Jun-Yan Zhu
TL;DR
The paper addresses the data attribution bottleneck in text-to-image models by distilling a slow unlearning-based attribution method into a fast, embedding-based retrieval framework. It introduces a two-stage data curation workflow and a learning-to-rank objective that maps synthesized prompts to training-image embeddings, enabling rapid identification of influential training data with minimal overhead. Across MSCOCO and Stable Diffusion on LAION data, the approach delivers substantial speedups (up to thousands of times faster than prior methods) while achieving competitive attribution performance, with a combined DINO+CLIP-Text feature space performing best. The method scales to hundreds of millions of training samples and large-scale models, offering a practical path toward real-world data attribution and potential compensation mechanisms, albeit with limitations tied to the teacher-based derivation and the need for further validation on extremely large models.
Abstract
Data attribution for text-to-image models aims to identify the training images that most significantly influenced a generated output. Existing attribution methods involve considerable computational resources for each query, making them impractical for real-world applications. We propose a novel approach for scalable and efficient data attribution. Our key idea is to distill a slow, unlearning-based attribution method to a feature embedding space for efficient retrieval of highly influential training images. During deployment, combined with efficient indexing and search methods, our method successfully finds highly influential images without running expensive attribution algorithms. We show extensive results on both medium-scale models trained on MSCOCO and large-scale Stable Diffusion models trained on LAION, demonstrating that our method can achieve better or competitive performance in a few seconds, faster than existing methods by 2,500x - 400,000x. Our work represents a meaningful step towards the large-scale application of data attribution methods on real-world models such as Stable Diffusion.
