MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation
Hyojeong Lee, Youngwan Jo, Inpyo Hong, Sanghyun Park
TL;DR
MRNet tackles the challenge of cross-modality medical image translation by integrating SAM-based frequency-aware features into a UNet-like architecture and coupling it with a dual-mask correction mechanism and a multi-term loss. The method leverages a Hierarchical SAM-Based Encoder to fuse SAM features at multiple resolutions and employs a multimask framework to refine feature selection, achieving state-of-the-art results on MRI-to-CT and MRI-to-MRI tasks. Quantitative gains are demonstrated across PSNR and SSIM, with MRNet outperforming strong baselines such as Pix2Pix, CycleGAN, TransUNet, Swin-T, ResViT, and PPT, while ablation confirms the importance of mask count and mask-based loss. The work offers practical implications for improved anatomical fidelity in clinical translation pipelines and suggests directions for efficiency and plane-aware extensions with clinical validation metrics.
Abstract
We propose a Multifaceted Resilient Network(MRNet), a novel architecture developed for medical image-to-image translation that outperforms state-of-the-art methods in MRI-to-CT and MRI-to-MRI conversion. MRNet leverages the Segment Anything Model (SAM) to exploit frequency-based features to build a powerful method for advanced medical image transformation. The architecture extracts comprehensive multiscale features from diverse datasets using a powerful SAM image encoder and performs resolution-aware feature fusion that consistently integrates U-Net encoder outputs with SAM-derived features. This fusion optimizes the traditional U-Net skip connection while leveraging transformer-based contextual analysis. The translation is complemented by an innovative dual-mask configuration incorporating dynamic attention patterns and a specialized loss function designed to address regional mapping mismatches, preserving both the gross anatomy and tissue details. Extensive validation studies have shown that MRNet outperforms state-of-the-art architectures, particularly in maintaining anatomical fidelity and minimizing translation artifacts.
