CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task
Kangzhen Yang, Tao Hu, Kexin Dai, Genggeng Chen, Yu Cao, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan
TL;DR
CRNet tackles unified restoration and enhancement from multi-exposure inputs to produce HDR-like images by aligning five exposures and jointly refining high-frequency details. It introduces frequency separation via pooling and a Multi-Branch Block to fuse high- and low-frequency content, plus a Convolutional Enhancement Block with 7$\times$7 depthwise separable convolutions and an inverted bottleneck ConvFFN to enlarge receptive fields. The architecture processes concatenated, aligned frames and fuses refined features with a reference frame through three High-Frequency Enhancement Modules, guided by a mu-law tone-mapped L1 loss $L = \lVert T(H) - T(\hat{H}) \rVert_1$ with $T(x) = \frac{\log(1 + \mu x)}{\log(1+\mu)}$ and $\mu = 5000$. Evaluated on track 1 of the Bracketing Image Restoration Challenge, CRNet achieves third place and outperforms previous SOTA models in both PSNR_$\mu$ and SSIM_$\mu$, while maintaining lower computational costs. The results demonstrate improved edge and texture preservation, especially in high-frequency regions, enabling robust unified restoration of motion-blurred, noisy, and under/overexposed scenes.
Abstract
In real-world scenarios, images captured often suffer from blurring, noise, and other forms of image degradation, and due to sensor limitations, people usually can only obtain low dynamic range images. To achieve high-quality images, researchers have attempted various image restoration and enhancement operations on photographs, including denoising, deblurring, and high dynamic range imaging. However, merely performing a single type of image enhancement still cannot yield satisfactory images. In this paper, to deal with the challenge above, we propose the Composite Refinement Network (CRNet) to address this issue using multiple exposure images. By fully integrating information-rich multiple exposure inputs, CRNet can perform unified image restoration and enhancement. To improve the quality of image details, CRNet explicitly separates and strengthens high and low-frequency information through pooling layers, using specially designed Multi-Branch Blocks for effective fusion of these frequencies. To increase the receptive field and fully integrate input features, CRNet employs the High-Frequency Enhancement Module, which includes large kernel convolutions and an inverted bottleneck ConvFFN. Our model secured third place in the first track of the Bracketing Image Restoration and Enhancement Challenge, surpassing previous SOTA models in both testing metrics and visual quality.
