FCVSR: A Frequency-aware Method for Compressed Video Super-Resolution
Qiang Zhu, Fan Zhang, Feiyu Chen, Shuyuan Zhu, David Bull, Bing Zeng
TL;DR
FCVSR tackles compressed video super-resolution by leveraging frequency-domain information through a motion-guided adaptive alignment (MGAA) and a multi-frequency feature refinement (MFFR). A frequency-aware loss, combining spatial and contrastive components, guides the restoration of fine high-frequency details. The key contributions—MGAA for motion-aware frequency-domain alignment, MFFR for subband-specific refinement, and the frequency-aware contrastive loss—collectively yield improved PSNR/SSIM/VMAF while maintaining low to moderate complexity. This approach offers practical benefits for improving the quality of compressed videos in real-world pipelines, especially where decoding artifacts and motion dynamics are challenging.
Abstract
Compressed video super-resolution (SR) aims to generate high-resolution (HR) videos from the corresponding low-resolution (LR) compressed videos. Recently, some compressed video SR methods attempt to exploit the spatio-temporal information in the frequency domain, showing great promise in super-resolution performance. However, these methods do not differentiate various frequency subbands spatially or capture the temporal frequency dynamics, potentially leading to suboptimal results. In this paper, we propose a deep frequency-based compressed video SR model (FCVSR) consisting of a motion-guided adaptive alignment (MGAA) network and a multi-frequency feature refinement (MFFR) module. Additionally, a frequency-aware contrastive loss is proposed for training FCVSR, in order to reconstruct finer spatial details. The proposed model has been evaluated on three public compressed video super-resolution datasets, with results demonstrating its effectiveness when compared to existing works in terms of super-resolution performance (up to a 0.14dB gain in PSNR over the second-best model) and complexity.
