QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution

Bowen Chai; Zheng Chen; Libo Zhu; Wenbo Li; Yong Guo; Yulun Zhang

QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution

Bowen Chai, Zheng Chen, Libo Zhu, Wenbo Li, Yong Guo, Yulun Zhang

TL;DR

QuantVSR tackles the practical deployment barrier of diffusion-based video super-resolution by introducing a low-bit post-training quantization framework. It combines a spatio-temporal complexity aware (STCA) mechanism with a learnable bias alignment (LBA) module to enable a dual-branch quantization layer that preserves FP performance while using 4- to 6-bit quantization. The method allocates layer-specific ranks based on calibration data, jointly refines the FP and low-bit branches, and trains a small bias adaptor to mitigate quantization bias; experiments show near FP performance at 4-bit and significant improvements over existing quantization methods on both synthetic and real-world VSR datasets. This work enables efficient, real-world deployment of diffusion-based VSR models on edge devices and resource-constrained settings, with code available at the provided repository.

Abstract

Diffusion models have shown superior performance in real-world video super-resolution (VSR). However, the slow processing speeds and heavy resource consumption of diffusion models hinder their practical application and deployment. Quantization offers a potential solution for compressing the VSR model. Nevertheless, quantizing VSR models is challenging due to their temporal characteristics and high fidelity requirements. To address these issues, we propose QuantVSR, a low-bit quantization model for real-world VSR. We propose a spatio-temporal complexity aware (STCA) mechanism, where we first utilize the calibration dataset to measure both spatial and temporal complexities for each layer. Based on these statistics, we allocate layer-specific ranks to the low-rank full-precision (FP) auxiliary branch. Subsequently, we jointly refine the FP and low-bit branches to achieve simultaneous optimization. In addition, we propose a learnable bias alignment (LBA) module to reduce the biased quantization errors. Extensive experiments on synthetic and real-world datasets demonstrate that our method obtains comparable performance with the FP model and significantly outperforms recent leading low-bit quantization methods. Code is available at: https://github.com/bowenchai/QuantVSR.

QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution

TL;DR

Abstract

QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)