Table of Contents
Fetching ...

CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement

Qiang Zhu, Jinhua Hao, Yukang Ding, Yu Liu, Qiao Mo, Ming Sun, Chao Zhou, Shuyuan Zhu

TL;DR

The Coding Priors-Guided Aggregation (CPGA) network to utilize temporal and spatial information from coding priors to remedy the shortage of previous datasets on the lack of coding information is proposed.

Abstract

Recently, numerous approaches have achieved notable success in compressed video quality enhancement (VQE). However, these methods usually ignore the utilization of valuable coding priors inherently embedded in compressed videos, such as motion vectors and residual frames, which carry abundant temporal and spatial information. To remedy this problem, we propose the Coding Priors-Guided Aggregation (CPGA) network to utilize temporal and spatial information from coding priors. The CPGA mainly consists of an inter-frame temporal aggregation (ITA) module and a multi-scale non-local aggregation (MNA) module. Specifically, the ITA module aggregates temporal information from consecutive frames and coding priors, while the MNA module globally captures spatial information guided by residual frames. In addition, to facilitate research in VQE task, we newly construct the Video Coding Priors (VCP) dataset, comprising 300 videos with various coding priors extracted from corresponding bitstreams. It remedies the shortage of previous datasets on the lack of coding information. Experimental results demonstrate the superiority of our method compared to existing state-of-the-art methods. The code and dataset will be released at https://github.com/VQE-CPGA/CPGA.git .

CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement

TL;DR

The Coding Priors-Guided Aggregation (CPGA) network to utilize temporal and spatial information from coding priors to remedy the shortage of previous datasets on the lack of coding information is proposed.

Abstract

Recently, numerous approaches have achieved notable success in compressed video quality enhancement (VQE). However, these methods usually ignore the utilization of valuable coding priors inherently embedded in compressed videos, such as motion vectors and residual frames, which carry abundant temporal and spatial information. To remedy this problem, we propose the Coding Priors-Guided Aggregation (CPGA) network to utilize temporal and spatial information from coding priors. The CPGA mainly consists of an inter-frame temporal aggregation (ITA) module and a multi-scale non-local aggregation (MNA) module. Specifically, the ITA module aggregates temporal information from consecutive frames and coding priors, while the MNA module globally captures spatial information guided by residual frames. In addition, to facilitate research in VQE task, we newly construct the Video Coding Priors (VCP) dataset, comprising 300 videos with various coding priors extracted from corresponding bitstreams. It remedies the shortage of previous datasets on the lack of coding information. Experimental results demonstrate the superiority of our method compared to existing state-of-the-art methods. The code and dataset will be released at https://github.com/VQE-CPGA/CPGA.git .
Paper Structure (17 sections, 5 equations, 8 figures, 6 tables)

This paper contains 17 sections, 5 equations, 8 figures, 6 tables.

Figures (8)

  • Figure 1: Comparison between previous methods and our CPGA. Compared with previous methods, the coding priors are extracted from the bitstream to input into our enhancement model for VQE.
  • Figure 2: Some examples of LQ frames, motion vectors, predictive frames and residual frames in proposed VCP dataset.
  • Figure 3: The architecture of CPGA. The MVs and predictive frames are fed into the ITA module to obtain the temporally-aggregated feature, and current residual frame is utilized in the MNA module to obtain the spatially-aggregated feature.
  • Figure 4: The structure of non-local aggregation unit (NLAU).
  • Figure 5: The structure of the quality enhancement (QE) module and the shift channel attention block (SCAB).
  • ...and 3 more figures