ForTIFAI: Fending Off Recursive Training Induced Failure for AI Model Collapse
Soheil Zibakhsh Shabgahi, Pedram Aghazadeh, Azalia Mirhoseini, Farinaz Koushanfar
TL;DR
This work tackles the problem of model collapse in an era of rising synthetic data by introducing Truncated Cross Entropy (TCE), a confidence-aware loss that masks high-confidence predictions to preserve tail diversity during recursive training. The authors demonstrate that TCE substantially delays collapse, enabling models to tolerate over 2x more synthetic data and maintaining knowledge retention and distributional fidelity across multiple architectures and datasets. A comprehensive evaluation framework simulates continual synthetic data accumulation and measures time-to-failure, KR-test accuracy, and KL divergence to validate robustness. The findings suggest that simple, loss-based interventions can significantly extend the practical lifetime of generative models in synthetic-data-dominated regimes, with broad applicability beyond language modeling to GMMs and VAEs as well.
Abstract
The increasing reliance on generative AI models is rapidly increasing the volume of synthetic data, with some projections suggesting that most available new data for training could be machine-generated by 2030. This shift to a mainly synthetic content presents a critical challenge: repeated training in synthetic data leads to a phenomenon known as model collapse, where model performance degrades over generations of training, eventually rendering the models ineffective. While the causes of model collapse are increasingly understood, effective mitigation strategies remain scarce. We address this challenge by leveraging a key insight: auto-regressive models tend to generate text sequences to which they assign high confidence (i.e., high log-likelihood). Based on this observation, we introduce the Truncated-Cross-Entropy (TCE) loss function. TCE mitigates collapse by selectively ignoring high-confidence tokens during training, effectively filtering out likely machine-generated artifacts from the learning process. Our experiments demonstrate that models trained with TCE not only learn effectively but also exhibit significantly increased resilience, tolerating over 2.3x more synthetic data before the onset of collapse. In addition, we provide an open-source benchmark for collapse dynamics in mixed-data settings. Our results demonstrate that confidence-aware training objectives can substantially delay collapse onset, offering a practical and generalizable tool for model robustness under synthetic-data exposure.
