Bezier Distillation
Ling Feng, SK Yang
TL;DR
The paper tackles error accumulation in Rectified Flow's distribution-to-distribution transport by introducing Bezier distillation, which inserts one or more guiding distributions and uses Bezier curve interpolation to shape the transport path. It formalizes quadratic and cubic Bezier-guided distillation with multi-teacher support, and derives drift-fitting objectives that enable direct prediction of the end state from the current state. The approach yields faster inference and improved sample quality with fewer Rectified Flow iterations, while maintaining alignment to the real data distribution $\\pi_1$. This framework offers a stable, efficient mechanism for distribution transfer in continuous-time generative models and suggests promising extensions to multimodal generation and optimal transport-based methods.
Abstract
In Rectified Flow, by obtaining the rectified flow several times, the mapping relationship between distributions can be distilled into a neural network, and the target distribution can be directly predicted by the straight lines of the flow. However, during the pairing process of the mapping relationship, a large amount of error accumulation will occur, resulting in a decrease in performance after multiple rectifications. In the field of flow models, knowledge distillation of multi - teacher diffusion models is also a problem worthy of discussion in accelerating sampling. I intend to combine multi - teacher knowledge distillation with Bezier curves to solve the problem of error accumulation. Currently, the related paper is being written by myself.
