What happens when generative AI models train recursively on each others' outputs?
Hung Anh Vu, Galen Reeves, Emily Wenger
TL;DR
The paper addresses how generative AI models might be trained on outputs from other models, proposing a formal interactive training framework parameterized by $\alpha$ and $\beta$. It derives exact mean and covariance recursions for a linear-Gaussian surrogate (via matrices $\bm{M}_t$, $\bm{C}_t$, $\bm{P}_t$, $\bm{Q}_t$) and identifies conditions for unbiasedness and convergence (notably $\rho(\bm{Q})<1$ and cases where $S$ is proportional to $\tilde{S}$). Empirically, two-model experiments with synthetic data corroborate the theory: mixing real and synthetic data can improve cross-task transfer but tends to homogenize representations, with optimal performance near $\alpha=\beta=0.5$. The findings have implications for data curation and model-update strategies in multi-model ecosystems, highlighting both potential benefits and risks of data-mediated interactions in future AI systems.
Abstract
The internet serves as a common source of training data for generative AI (genAI) models but is increasingly populated with AI-generated content. This duality raises the possibility that future genAI models may be trained on other models' generated outputs. Prior work has studied consequences of models training on their own generated outputs, but limited work has considered what happens if models ingest content produced by other models. Given society's increasing dependence on genAI tools, understanding such data-mediated model interactions is critical. This work provides empirical evidence for how data-mediated interactions might unfold in practice, develops a theoretical model for this interactive training process, and experimentally validates the theory. We find that data-mediated interactions can benefit models by exposing them to novel concepts perhaps missed in original training data, but also can homogenize their performance on shared tasks.
