CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer
Feiyang Jia, Zhineng Chen, Ziying Song, Lin Liu, Caiyan Jia
TL;DR
CWT-Net tackles the challenge of preserving multi-scale structural information in histopathology image super-resolution by coupling a dedicated SR branch with a Wavelet Transform branch that extracts cross-scale high-frequency features. A Transformer module enables cross-scale fusion, guided by a Wavelet Reconstruction (WR) block so WT information can be leveraged during training and testing, and a new MLCamSR dataset provides cross-scale, undegraded information for robust learning. Empirical results show state-of-the-art PSNR/SSIM gains and qualitative improvements in high-frequency detail, with demonstrated benefits to downstream diagnostic classification. This framework offers a practical path for pathology image enhancement and potential pre-training priors for related medical imaging tasks.
Abstract
Super-resolution (SR) aims to enhance the quality of low-resolution images and has been widely applied in medical imaging. We found that the design principles of most existing methods are influenced by SR tasks based on real-world images and do not take into account the significance of the multi-level structure in pathological images, even if they can achieve respectable objective metric evaluations. In this work, we delve into two super-resolution working paradigms and propose a novel network called CWT-Net, which leverages cross-scale image wavelet transform and Transformer architecture. Our network consists of two branches: one dedicated to learning super-resolution and the other to high-frequency wavelet features. To generate high-resolution histopathology images, the Transformer module shares and fuses features from both branches at various stages. Notably, we have designed a specialized wavelet reconstruction module to effectively enhance the wavelet domain features and enable the network to operate in different modes, allowing for the introduction of additional relevant information from cross-scale images. Our experimental results demonstrate that our model significantly outperforms state-of-the-art methods in both performance and visualization evaluations and can substantially boost the accuracy of image diagnostic networks.
