Learning Single Index Models with Diffusion Priors
Anqi Tang, Youming Chen, Shuchen Xue, Zhaoqiang Liu
TL;DR
This work tackles the problem of recovering a high-dimensional signal from nonlinear, possibly discontinuous measurements in a semi-parametric single index model by leveraging diffusion priors. It introduces SIM-DMIS, a method that uses one round of unconditional diffusion model sampling plus partial inversion to reconstruct the signal without knowing the link function $f$. The authors provide theoretical guarantees under mild assumptions and demonstrate substantial reconstruction quality improvements over state-of-the-art baselines while dramatically reducing neural function evaluations across FFHQ, ImageNet, and CIFAR-10 datasets. The approach offers a practical, efficient framework for nonlinear signal recovery with diffusion priors and broad potential impact in imaging and related inverse problems.
Abstract
Diffusion models (DMs) have demonstrated remarkable ability to generate diverse and high-quality images by efficiently modeling complex data distributions. They have also been explored as powerful generative priors for signal recovery, resulting in a substantial improvement in the quality of reconstructed signals. However, existing research on signal recovery with diffusion models either focuses on specific reconstruction problems or is unable to handle nonlinear measurement models with discontinuous or unknown link functions. In this work, we focus on using DMs to achieve accurate recovery from semi-parametric single index models, which encompass a variety of popular nonlinear models that may have {\em discontinuous} and {\em unknown} link functions. We propose an efficient reconstruction method that only requires one round of unconditional sampling and (partial) inversion of DMs. Theoretical analysis on the effectiveness of the proposed methods has been established under appropriate conditions. We perform numerical experiments on image datasets for different nonlinear measurement models. We observe that compared to competing methods, our approach can yield more accurate reconstructions while utilizing significantly fewer neural function evaluations.
