FlowSteer: Conditioning Flow Field for Consistent Image Restoration
Tharindu Wickremasinghe, Chenyang Qi, Harshana Weligampola, Zhengzhong Tu, Stanley H. Chan
TL;DR
FlowSteer addresses the mismatch between flow-based image generation priors and pixel-level fidelity required in restoration. By introducing a simple, training-free scheduler that explicitly gates forward-model conditioning during the reconstruction path, FlowSteer reconciles measurement fidelity with rich generative priors in zero-shot fashion. The method, demonstrated on colorization, super-resolution, deblurring, and denoising, leverages a pre-trained flow model (Flux FLUX2024) with implicit attentional guidance augmented by a sparse fidelity-update schedule, yielding higher PSNR/LPIPS and preserved identity. The results highlight FlowSteer’s practicality as a universal, post-hoc conditioning mechanism for off-the-shelf flow models, with clear guidelines for schedule design and notable ablations clarifying the role of timing and artifacts.
Abstract
Flow-based text-to-image (T2I) models excel at prompt-driven image generation, but falter on Image Restoration (IR), often "drifting away" from being faithful to the measurement. Prior work mitigate this drift with data-specific flows or task-specific adapters that are computationally heavy and not scalable across tasks. This raises the question "Can't we efficiently manipulate the existing generative capabilities of a flow model?" To this end, we introduce FlowSteer (FS), an operator-aware conditioning scheme that injects measurement priors along the sampling path,coupling a frozed flow's implicit guidance with explicit measurement constraints. Across super-resolution, deblurring, denoising, and colorization, FS improves measurement consistency and identity preservation in a strictly zero-shot setting-no retrained models, no adapters. We show how the nature of flow models and their sensitivities to noise inform the design of such a scheduler. FlowSteer, although simple, achieves a higher fidelity of reconstructed images, while leveraging the rich generative priors of flow models.
