Beyond In-Place Corruption: Insertion and Deletion In Denoising Probabilistic Models
Daniel D. Johnson, Jacob Austin, Rianne van den Berg, Daniel Tarlow
TL;DR
This work extends denoising diffusion probabilistic models to non-in-place corruptions by introducing insertions and deletions into the forward process, enabling edits that can align sequences more flexibly. A probabilistic framework with PFST-based forward representations and a two-headed transformer for the reverse process yields tractable log-likelihood estimates and practical generation capabilities. Experiments on arithmetic sequences show improved likelihood and accuracy with moderate insert/delete rates, while text8 experiments demonstrate the model's ability to fix spelling errors and perform human-like edits without fine-tuning. The approach broadens the applicability of diffusion-based sequence generation to more realistic editing tasks and suggests several future directions for non-in-place edits in other modalities.
Abstract
Denoising diffusion probabilistic models (DDPMs) have shown impressive results on sequence generation by iteratively corrupting each example and then learning to map corrupted versions back to the original. However, previous work has largely focused on in-place corruption, adding noise to each pixel or token individually while keeping their locations the same. In this work, we consider a broader class of corruption processes and denoising models over sequence data that can insert and delete elements, while still being efficient to train and sample from. We demonstrate that these models outperform standard in-place models on an arithmetic sequence task, and that when trained on the text8 dataset they can be used to fix spelling errors without any fine-tuning.
