No-Clean-Reference Image Super-Resolution: Application to Electron Microscopy
Mohammad Khateri, Morteza Ghahremani, Alejandra Sierra, Jussi Tohka
TL;DR
This work tackles the challenge of obtaining clean, high-resolution 3D electron microscopy images over large brain volumes by proposing EMSR, a deep-learning SR framework that can be trained with no-clean references. EMSR integrates edge-attention and Vision Transformer blocks with a noise-robust, self-supervised design and uses a multi-term loss to learn from corrupted LR/HR pairs as well as denoised references. Through experiments on nine brain datasets, EMSR demonstrates competitive or superior performance to several SR baselines, with the ability to train effectively using real, denoised, or synthetically degraded pairs. The approach enables faster, less costly EM imaging while providing high-quality reconstructions suitable for downstream neuroanatomical analysis, and highlights the potential of synthetic degradations to augment EM SR under matched conditions.
Abstract
The inability to acquire clean high-resolution (HR) electron microscopy (EM) images over a large brain tissue volume hampers many neuroscience studies. To address this challenge, we propose a deep-learning-based image super-resolution (SR) approach to computationally reconstruct clean HR 3D-EM with a large field of view (FoV) from noisy low-resolution (LR) acquisition. Our contributions are I) Investigating training with no-clean references for $\ell_2$ and $\ell_1$ loss functions; II) Introducing a novel network architecture, named EMSR, for enhancing the resolution of LR EM images while reducing inherent noise; and, III) Comparing different training strategies including using acquired LR and HR image pairs, i.e., real pairs with no-clean references contaminated with real corruptions, the pairs of synthetic LR and acquired HR, as well as acquired LR and denoised HR pairs. Experiments with nine brain datasets showed that training with real pairs can produce high-quality super-resolved results, demonstrating the feasibility of training with non-clean references for both loss functions. Additionally, comparable results were observed, both visually and numerically, when employing denoised and noisy references for training. Moreover, utilizing the network trained with synthetically generated LR images from HR counterparts proved effective in yielding satisfactory SR results, even in certain cases, outperforming training with real pairs. The proposed SR network was compared quantitatively and qualitatively with several established SR techniques, showcasing either the superiority or competitiveness of the proposed method in mitigating noise while recovering fine details.
