Generative Autoregressive Transformers for Model-Agnostic Federated MRI Reconstruction
Valiyeh A. Nezhad, Gokberk Elmas, Bilal Kabas, Fuat Arslan, Emine U. Saritas, Tolga Çukur
TL;DR
This work addresses the challenge of cross-site generalization in MRI reconstruction under privacy constraints by proposing FedGAT, a model-agnostic federated learning framework. It decouples collaborative learning into a federated GAT prior, built from a frozen VAE and a site-conditioned autoregressive transformer, and a second tier where each site trains a reconstruction model on a mix of local and synthetic data generated by the prior. Site prompts and multi-scale autoregression enable controlled, high-fidelity synthesis across sites, while augmentation with synthetic data improves generalization without sharing raw data. Experiments on multi-institutional datasets demonstrate that FedGAT surpasses state-of-the-art FL baselines in both within-site and cross-site reconstruction, validating its ability to support heterogeneous architectures and scalable collaboration in privacy-preserving MRI studies.
Abstract
While learning-based models hold great promise for MRI reconstruction, single-site models trained on limited local datasets often show poor generalization. This has motivated collaborative training across institutions via federated learning (FL)-a privacy-preserving framework that aggregates model updates instead of sharing raw data. Conventional FL requires architectural homogeneity, restricting sites from using models tailored to their resources or needs. To address this limitation, we propose FedGAT, a model-agnostic FL technique that first collaboratively trains a global generative prior for MR images, adapted from a natural image foundation model composed of a variational autoencoder (VAE) and a transformer that generates images via spatial-scale autoregression. We fine-tune the transformer module after injecting it with a lightweight site-specific prompting mechanism, keeping the VAE frozen, to efficiently adapt the model to multi-site MRI data. In a second tier, each site independently trains its preferred reconstruction model by augmenting local data with synthetic MRI data from other sites, generated by site-prompting the tuned prior. This decentralized augmentation improves generalization while preserving privacy. Experiments on multi-institutional datasets show that FedGAT outperforms state-of-the-art FL baselines in both within- and cross-site reconstruction performance under model-heterogeneous settings.
