Efficient Image Restoration via Latent Consistency Flow Matching
Elad Cohen, Idan Achituve, Idit Diamant, Arnon Netzer, Hai Victor Habi
TL;DR
ELIR tackles efficient image restoration by operating in a compact latent space and guiding restoration with Latent Consistency Flow Matching (LCFM), which blends latent flow modeling with consistency constraints to minimize neural function evaluations. A lightweight convolutional architecture—enabled by a Tiny AutoEncoder, a small coarse estimator, and a U-Net-based velocity field—yields model sizes around tens of millions of parameters and高 FPS, suitable for edge devices. The method achieves competitive distortion and perceptual quality across blind face restoration, super-resolution, denoising, and inpainting while significantly reducing computational cost relative to diffusion and flow-based baselines. Theoretical bounds on Wasserstein-2 distance link encoder-decoder and vector-field errors to reconstruction quality, and extensive ablations demonstrate the efficiency and importance of the coarse estimator, encoder fine-tuning, and multi-segment latent training.
Abstract
Recent advances in generative image restoration (IR) have demonstrated impressive results. However, these methods are hindered by their substantial size and computational demands, rendering them unsuitable for deployment on edge devices. This work introduces ELIR, an Efficient Latent Image Restoration method. ELIR addresses the distortion-perception trade-off within the latent space and produces high-quality images using a latent consistency flow-based model. In addition, ELIR introduces an efficient and lightweight architecture. Consequently, ELIR is 4$\times$ smaller and faster than state-of-the-art diffusion and flow-based approaches for blind face restoration, enabling a deployment on resource-constrained devices. Comprehensive evaluations of various image restoration tasks and datasets show that ELIR achieves competitive performance compared to state-of-the-art methods, effectively balancing distortion and perceptual quality metrics while significantly reducing model size and computational cost. The code is available at: https://github.com/eladc-git/ELIR
