Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning
Ziheng Cheng, Tianyu Xie, Shiyue Zhang, Cheng Zhang
TL;DR
The paper tackles the data inefficiency of conditional diffusion models by introducing a representation-learning framework that captures a shared low-dimensional conditioning space across tasks. It proves generalization and end-to-end distribution-estimation guarantees for transfer/meta-learning CDMs, leveraging deep ReLU networks to approximate both the score function and condition representations. The results show improved sample efficiency in few-shot and meta-learning regimes, with theoretical rates and practical experiments on conditioned diffusion and MNIST image restoration. This work lays a foundation for principled transfer learning in unsupervised diffusion-based generation, bridging representation learning with probabilistic modeling and offering insights for real-world applications with limited target data.
Abstract
While conditional diffusion models have achieved remarkable success in various applications, they require abundant data to train from scratch, which is often infeasible in practice. To address this issue, transfer learning has emerged as an essential paradigm in small data regimes. Despite its empirical success, the theoretical underpinnings of transfer learning conditional diffusion models remain unexplored. In this paper, we take the first step towards understanding the sample efficiency of transfer learning conditional diffusion models through the lens of representation learning. Inspired by practical training procedures, we assume that there exists a low-dimensional representation of conditions shared across all tasks. Our analysis shows that with a well-learned representation from source tasks, the samplecomplexity of target tasks can be reduced substantially. In addition, we investigate the practical implications of our theoretical results in several real-world applications of conditional diffusion models. Numerical experiments are also conducted to verify our results.
