Table of Contents
Fetching ...

Heterogeneous window transformer for image denoising

Chunwei Tian, Menghua Zheng, Chia-Wen Lin, Zhiwu Li, David Zhang

TL;DR

HWformer addresses image denoising by bridging long-range pixel interactions and local detail preservation using a heterogeneous window transformer. It introduces heterogeneous global windows (GTEBlock) and a Transformer direction enhancement block (TDEBlock) with horizontal/vertical shifts and a sparse FFN to capture both global context and local neighboring patches without increasing denoising time. The architecture achieves competitive or state-of-the-art performance on synthetic and real-noise datasets while requiring only about 30% of Restormer’s denoising time, making it suitable for mobile devices. Ablation studies validate window size, directional shifts, and sparsity as key factors for performance and efficiency.

Abstract

Deep networks can usually depend on extracting more structural information to improve denoising results. However, they may ignore correlation between pixels from an image to pursue better denoising performance. Window transformer can use long- and short-distance modeling to interact pixels to address mentioned problem. To make a tradeoff between distance modeling and denoising time, we propose a heterogeneous window transformer (HWformer) for image denoising. HWformer first designs heterogeneous global windows to capture global context information for improving denoising effects. To build a bridge between long and short-distance modeling, global windows are horizontally and vertically shifted to facilitate diversified information without increasing denoising time. To prevent the information loss phenomenon of independent patches, sparse idea is guided a feed-forward network to extract local information of neighboring patches. The proposed HWformer only takes 30% of popular Restormer in terms of denoising time.

Heterogeneous window transformer for image denoising

TL;DR

HWformer addresses image denoising by bridging long-range pixel interactions and local detail preservation using a heterogeneous window transformer. It introduces heterogeneous global windows (GTEBlock) and a Transformer direction enhancement block (TDEBlock) with horizontal/vertical shifts and a sparse FFN to capture both global context and local neighboring patches without increasing denoising time. The architecture achieves competitive or state-of-the-art performance on synthetic and real-noise datasets while requiring only about 30% of Restormer’s denoising time, making it suitable for mobile devices. Ablation studies validate window size, directional shifts, and sparsity as key factors for performance and efficiency.

Abstract

Deep networks can usually depend on extracting more structural information to improve denoising results. However, they may ignore correlation between pixels from an image to pursue better denoising performance. Window transformer can use long- and short-distance modeling to interact pixels to address mentioned problem. To make a tradeoff between distance modeling and denoising time, we propose a heterogeneous window transformer (HWformer) for image denoising. HWformer first designs heterogeneous global windows to capture global context information for improving denoising effects. To build a bridge between long and short-distance modeling, global windows are horizontally and vertically shifted to facilitate diversified information without increasing denoising time. To prevent the information loss phenomenon of independent patches, sparse idea is guided a feed-forward network to extract local information of neighboring patches. The proposed HWformer only takes 30% of popular Restormer in terms of denoising time.
Paper Structure (12 sections, 1 equation, 6 figures, 12 tables, 3 algorithms)

This paper contains 12 sections, 1 equation, 6 figures, 12 tables, 3 algorithms.

Figures (6)

  • Figure 1: Network architecture of HWformer.
  • Figure 2: Visual comparisons with state-of-the-art methods on gray image denoising. The sample comes from Urban100huang2015single.
  • Figure 3: Visual comparisons with state-of-the-art methods on gray image denoising. The samples come from Urban100huang2015single.
  • Figure 4: XVisual comparisons with state-of-the-art methods on color image denoising. The samples come from Urban100huang2015single.
  • Figure 5: Visual comparisons with state-of-the-art methods on color image denoising. The samples come from Urban100huang2015single.
  • ...and 1 more figures