Table of Contents
Fetching ...

A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising

Shuaiyu Yuan, Tristan Whitmarsh, Dimitri A Kessler, Otso Arponen, Mary A McLean, Gabrielle Baxter, Frank Riemer, Aneurin J Kennerley, William J Brackenbury, Fiona J Gilbert, Joshua D Kaggie

TL;DR

This work tackles the lack of ground-truth references for evaluating sodium MRI denoising by introducing Model Specialization Metric (MSM), a label-free no-reference IQA method that leverages the degradation in DL model predictions when input data diverges from training distributions. MSM computes the distance between an input image and the model’s prediction, using backbones such as U-net and SwinIR, and is trained on clean images to map ground truth to itself. Across proton T$_1$w and sodium MRI datasets, MSM demonstrates competitive SRCC/PLCC performance against FR- and NR-IQA baselines and shows substantial agreement with expert judgments for denoised sodium MR images, particularly with the SwinIR backbone (average $\kappa$ ≈ 0.653). The results support MSM as a practical, ground-truth-free tool for QC of denoised sodium MRI, with implications for clinical reliability and workflow efficiency, and motivate future hybrid architectures to further improve robustness across distortions.

Abstract

New multinuclear MRI techniques, such as sodium MRI, generally suffer from low image quality due to an inherently low signal. Postprocessing methods, such as image denoising, have been developed for image enhancement. However, the assessment of these enhanced images is challenging especially considering when there is a lack of high resolution and high signal images as reference, such as in sodium MRI. No-reference Image Quality Assessment (NR-IQA) metrics are approaches to solve this problem. Existing learning-based NR-IQA metrics rely on labels derived from subjective human opinions or metrics like Signal-to-Noise Ratio (SNR), which are either time-consuming or lack accurate ground truths, resulting in unreliable assessment. We note that deep learning (DL) models have a unique characteristic in that they are specialized to a characteristic training set, meaning that deviations between the input testing data from the training data will reduce prediction accuracy. Therefore, we propose a novel DL-based NR-IQA metric, the Model Specialization Metric (MSM), which does not depend on ground-truth images or labels. MSM measures the difference between the input image and the model's prediction for evaluating the quality of the input image. Experiments conducted on both simulated distorted proton T1-weighted MR images and denoised sodium MR images demonstrate that MSM exhibits a superior evaluation performance on various simulated noises and distortions. MSM also has a substantial agreement with the expert evaluations, achieving an averaged Cohen's Kappa coefficient of 0.6528, outperforming the existing NR-IQA metrics.

A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising

TL;DR

This work tackles the lack of ground-truth references for evaluating sodium MRI denoising by introducing Model Specialization Metric (MSM), a label-free no-reference IQA method that leverages the degradation in DL model predictions when input data diverges from training distributions. MSM computes the distance between an input image and the model’s prediction, using backbones such as U-net and SwinIR, and is trained on clean images to map ground truth to itself. Across proton Tw and sodium MRI datasets, MSM demonstrates competitive SRCC/PLCC performance against FR- and NR-IQA baselines and shows substantial agreement with expert judgments for denoised sodium MR images, particularly with the SwinIR backbone (average ≈ 0.653). The results support MSM as a practical, ground-truth-free tool for QC of denoised sodium MRI, with implications for clinical reliability and workflow efficiency, and motivate future hybrid architectures to further improve robustness across distortions.

Abstract

New multinuclear MRI techniques, such as sodium MRI, generally suffer from low image quality due to an inherently low signal. Postprocessing methods, such as image denoising, have been developed for image enhancement. However, the assessment of these enhanced images is challenging especially considering when there is a lack of high resolution and high signal images as reference, such as in sodium MRI. No-reference Image Quality Assessment (NR-IQA) metrics are approaches to solve this problem. Existing learning-based NR-IQA metrics rely on labels derived from subjective human opinions or metrics like Signal-to-Noise Ratio (SNR), which are either time-consuming or lack accurate ground truths, resulting in unreliable assessment. We note that deep learning (DL) models have a unique characteristic in that they are specialized to a characteristic training set, meaning that deviations between the input testing data from the training data will reduce prediction accuracy. Therefore, we propose a novel DL-based NR-IQA metric, the Model Specialization Metric (MSM), which does not depend on ground-truth images or labels. MSM measures the difference between the input image and the model's prediction for evaluating the quality of the input image. Experiments conducted on both simulated distorted proton T1-weighted MR images and denoised sodium MR images demonstrate that MSM exhibits a superior evaluation performance on various simulated noises and distortions. MSM also has a substantial agreement with the expert evaluations, achieving an averaged Cohen's Kappa coefficient of 0.6528, outperforming the existing NR-IQA metrics.
Paper Structure (26 sections, 9 equations, 3 figures, 5 tables)

This paper contains 26 sections, 9 equations, 3 figures, 5 tables.

Figures (3)

  • Figure 1: (Left) An example of Model Specialization. A reference image was distorted using Gaussian noises with varying noise levels from 0 to 0.25. The graph plots the PSNR values of U-net denoised results against the noise levels of the corresponding noisy images. (Right) A graph showing the association between the noise levels of input image and the PSNR value difference between the input and the denoised images. The legend $\sigma$ indicates the noise levels at which the U-net was trained, specifically 0, 0.05, 0.10, 0.15, and 0.20.
  • Figure 2: Examples of (a) proton T$_1$w MR image with additional Gaussian and Rician noise, with standard deviations of 0.05, 0.10 and 0.15, and blurred using a Gaussian and a motion blur filter with filter sizes of 3, 7 and 11; (b) Synthetic MR images generated from DDPM at timesteps of 40, 30, 20, 10 and 0.
  • Figure 3: Examples of an authentic sodium MR image and corresponding denoising results of median filter, BM3D, DnCNN, U-net and ResUnet.