Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network
Wenjie Li, Heng Guo, Xuannan Liu, Kongming Liang, Jiani Hu, Zhanyu Ma, Jun Guo
TL;DR
This work tackles face super-resolution (FSR) by mitigating distortions caused by downsampling in encoder–decoder models. It introduces WFEN, a wavelet-based feature enhancement network that uses discrete wavelet transform to split features into low- and high-frequency bands and a full-domain Transformer (FDT) to comprehensively capture local, regional, and global facial information. The main contributions are the Wavelet Feature Downsample (WFD) and Wavelet Feature Upgrade (WFU) for distortion-free downsampling and upsampling, and the Full-domain Transformer (FDT) for cross-scale feature fusion, achieving a favorable balance between performance, model size, and speed. Extensive experiments on CelebA, Helen, and SCface demonstrate improved fidelity (PSNR/SSIM/LPIPS/VIF) and identity preservation with lower computational cost, highlighting the method’s practical potential for real-world FSR and surveillance applications.
Abstract
Face super-resolution aims to reconstruct a high-resolution face image from a low-resolution face image. Previous methods typically employ an encoder-decoder structure to extract facial structural features, where the direct downsampling inevitably introduces distortions, especially to high-frequency features such as edges. To address this issue, we propose a wavelet-based feature enhancement network, which mitigates feature distortion by losslessly decomposing the input feature into high and low-frequency components using the wavelet transform and processing them separately. To improve the efficiency of facial feature extraction, a full domain Transformer is further proposed to enhance local, regional, and global facial features. Such designs allow our method to perform better without stacking many modules as previous methods did. Experiments show that our method effectively balances performance, model size, and speed. Code link: https://github.com/PRIS-CV/WFEN.
