Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis

Tao Song; Yicheng Wu; Minhao Hu; Xiangde Luo; Linda Wei; Guotai Wang; Yi Guo; Feng Xu; Shaoting Zhang

Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis

Tao Song, Yicheng Wu, Minhao Hu, Xiangde Luo, Linda Wei, Guotai Wang, Yi Guo, Feng Xu, Shaoting Zhang

TL;DR

This work targets the challenge of synthesizing missing MRI modalities from multi-modal data under imperfect cross-modality alignment. It introduces AGI-Net, a plug-in convolutional framework featuring Cross Group Attention and Group-wise Rolling to model intra- and inter-modality relationships and adapt convolutional kernels per modality group. Through extensive experiments on IXI and BraTS2023, AGI-Net achieves state-of-the-art results across 2D and 3D multimodal synthesis tasks, demonstrates robustness to misalignment, and demonstrates improved brain-tumor segmentation when using synthesized modalities. The approach offers a scalable, efficient path to better multimodal MRI synthesis with practical implications for clinical workflows.

Abstract

Multimodal MR image synthesis aims to generate missing modality images by effectively fusing and mapping from a subset of available MRI modalities. Most existing methods adopt an image-to-image translation paradigm, treating multiple modalities as input channels. However, these approaches often yield sub-optimal results due to the inherent difficulty in achieving precise feature- or semantic-level alignment across modalities. To address these challenges, we propose an Adaptive Group-wise Interaction Network (AGI-Net) that explicitly models both inter-modality and intra-modality relationships for multimodal MR image synthesis. Specifically, feature channels are first partitioned into predefined groups, after which an adaptive rolling mechanism is applied to conventional convolutional kernels to better capture feature and semantic correspondences between different modalities. In parallel, a cross-group attention module is introduced to enable effective feature fusion across groups, thereby enhancing the network's representational capacity. We validate the proposed AGI-Net on the publicly available IXI and BraTS2023 datasets. Experimental results demonstrate that AGI-Net achieves state-of-the-art performance in multimodal MR image synthesis tasks, confirming the effectiveness of its modality-aware interaction design. We release the relevant code at: https://github.com/zunzhumu/Adaptive-Group-wise-Interaction-Network-for-Multimodal-MRI-Synthesis.git.

Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis

TL;DR

Abstract

Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)