Table of Contents
Fetching ...

VHU-Net: Variational Hadamard U-Net for Body MRI Bias Field Correction

Xin Zhu, Ahmet Enis Cetin, Gorkem Durak, Batuhan Gundogdu, Ziliang Hong, Hongyi Pan, Ertugrul Aktas, Elif Keles, Hatice Savas, Aytekin Oto, Hiten Patel, Adam B. Murphy, Ashley Ross, Frank Miller, Baris Turkbey, Ulas Bagci

TL;DR

VHU-Net addresses bias field correction in body MRI by projecting data into the Hadamard domain and enforcing spectral sparsity via a variational ELBO objective. The encoder uses ConvHTBlocks to isolate low-frequency bias components, while the decoder employs an inverse HT-reconstructed transformer and a hypernetwork for global, frequency-aware reconstruction of the bias field. The method demonstrates superior intensity uniformity and downstream segmentation performance across abdominal, prostate, and breast MRI datasets, including multi-center evaluations, with fast inference suitable for clinical use. The combination of HT-based feature extraction, Transformer-based inter-channel attention, and variational regularization yields robust, interpretable bias correction with strong generalization capabilities.

Abstract

Bias field artifacts in magnetic resonance imaging (MRI) scans introduce spatially smooth intensity inhomogeneities that degrade image quality and hinder downstream analysis. To address this challenge, we propose a novel variational Hadamard U-Net (VHU-Net) for effective body MRI bias field correction. The encoder comprises multiple convolutional Hadamard transform blocks (ConvHTBlocks), each integrating convolutional layers with a Hadamard transform (HT) layer. Specifically, the HT layer performs channel-wise frequency decomposition to isolate low-frequency components, while a subsequent scaling layer and semi-soft thresholding mechanism suppress redundant high-frequency noise. To compensate for the HT layer's inability to model inter-channel dependencies, the decoder incorporates an inverse HT-reconstructed transformer block, enabling global, frequency-aware attention for the recovery of spatially consistent bias fields. The stacked decoder ConvHTBlocks further enhance the capacity to reconstruct the underlying ground-truth bias field. Building on the principles of variational inference, we formulate a new evidence lower bound (ELBO) as the training objective, promoting sparsity in the latent space while ensuring accurate bias field estimation. Comprehensive experiments on body MRI datasets demonstrate the superiority of VHU-Net over existing state-of-the-art methods in terms of intensity uniformity. Moreover, the corrected images yield substantial downstream improvements in segmentation accuracy. Our framework offers computational efficiency, interpretability, and robust performance across multi-center datasets, making it suitable for clinical deployment.

VHU-Net: Variational Hadamard U-Net for Body MRI Bias Field Correction

TL;DR

VHU-Net addresses bias field correction in body MRI by projecting data into the Hadamard domain and enforcing spectral sparsity via a variational ELBO objective. The encoder uses ConvHTBlocks to isolate low-frequency bias components, while the decoder employs an inverse HT-reconstructed transformer and a hypernetwork for global, frequency-aware reconstruction of the bias field. The method demonstrates superior intensity uniformity and downstream segmentation performance across abdominal, prostate, and breast MRI datasets, including multi-center evaluations, with fast inference suitable for clinical use. The combination of HT-based feature extraction, Transformer-based inter-channel attention, and variational regularization yields robust, interpretable bias correction with strong generalization capabilities.

Abstract

Bias field artifacts in magnetic resonance imaging (MRI) scans introduce spatially smooth intensity inhomogeneities that degrade image quality and hinder downstream analysis. To address this challenge, we propose a novel variational Hadamard U-Net (VHU-Net) for effective body MRI bias field correction. The encoder comprises multiple convolutional Hadamard transform blocks (ConvHTBlocks), each integrating convolutional layers with a Hadamard transform (HT) layer. Specifically, the HT layer performs channel-wise frequency decomposition to isolate low-frequency components, while a subsequent scaling layer and semi-soft thresholding mechanism suppress redundant high-frequency noise. To compensate for the HT layer's inability to model inter-channel dependencies, the decoder incorporates an inverse HT-reconstructed transformer block, enabling global, frequency-aware attention for the recovery of spatially consistent bias fields. The stacked decoder ConvHTBlocks further enhance the capacity to reconstruct the underlying ground-truth bias field. Building on the principles of variational inference, we formulate a new evidence lower bound (ELBO) as the training objective, promoting sparsity in the latent space while ensuring accurate bias field estimation. Comprehensive experiments on body MRI datasets demonstrate the superiority of VHU-Net over existing state-of-the-art methods in terms of intensity uniformity. Moreover, the corrected images yield substantial downstream improvements in segmentation accuracy. Our framework offers computational efficiency, interpretability, and robust performance across multi-center datasets, making it suitable for clinical deployment.

Paper Structure

This paper contains 45 sections, 37 equations, 14 figures, 10 tables.

Figures (14)

  • Figure 1: (a) Observed abdominal MRI image; (b) Bias-corrected image; (c) Bias field; (d–f) Frequency spectra corresponding to (a–c), respectively.
  • Figure 2: (a) VHU-Net architecture; (b) Overview of bias field correction workflow.
  • Figure 3: (a) ConvHTBlock; (b) HT layer
  • Figure 4: Visualization of threshold functions
  • Figure 5: IHT-reconstructed transformer block.
  • ...and 9 more figures