Adaptive Inference-Time Scaling via Cyclic Diffusion Search
Gyubin Lee, Truong Nhat Nguyen Bao, Jaesik Yoon, Dongwoo Lee, Minsu Kim, Yoshua Bengio, Sungjin Ahn
TL;DR
Adaptive Bi-directional Cyclic Diffusion (ABCD) rethinks inference-time compute by casting diffusion-model inference as a flexible search. It combines Cyclic Diffusion Search, Automatic Exploration-Exploitation Balancing, and Adaptive Thinking Time to allocate computation adaptively per instance and terminate when further gains are unlikely. Across planning, maze solving, Sudoku, molecule generation, and text-to-image tasks, ABCD yields stronger performance under comparable or lower compute than fixed-schedule baselines. This work demonstrates that instance-aware inference-time scaling can significantly improve both efficiency and outcome quality in diffusion-based reasoning and generation.
Abstract
Diffusion models have demonstrated strong generative capabilities across domains ranging from image synthesis to complex reasoning tasks. However, most inference-time scaling methods rely on fixed denoising schedules, limiting their ability to allocate computation based on instance difficulty or task-specific demands adaptively. We introduce the challenge of adaptive inference-time scaling-dynamically adjusting computational effort during inference-and propose Adaptive Bi-directional Cyclic Diffusion (ABCD), a flexible, search-based inference framework. ABCD refines outputs through bi-directional diffusion cycles while adaptively controlling exploration depth and termination. It comprises three components: Cyclic Diffusion Search, Automatic Exploration-Exploitation Balancing, and Adaptive Thinking Time. Experiments show that ABCD improves performance across diverse tasks while maintaining computational efficiency.
