Deterministic Medical Image Translation via High-fidelity Brownian Bridges
Qisheng He, Nicholas Summerfield, Peiyong Wang, Carri Glide-Hurst, Ming Dong
TL;DR
The paper addresses the need for deterministic, high-fidelity medical image translation. It introduces HiFi-BBrg, a dual-mapping framework combining a conditional Brownian bridge generator Γ and a reconstruction cGAN Π, trained with a diffusion loss, fidelity loss, and adversarial loss to ensure reversibility and high fidelity. The forward process follows the Brownian bridge dynamics $dX = -\frac{X-X_T}{1-t}\,dt + 2\sqrt{1-t}\,dW(t)$, and sampling reduces to a deterministic one-step $X_T - \epsilon_\theta(X_T,X_T,T)$, yielding low-variance trajectories. Empirical results on iSEG 2017, BraTS 2018, and Prostate MRI demonstrate state-of-the-art performance in multi-modal translation and multi-image super-resolution, with deterministic outputs. The work highlights potential extensions to unpaired data through bilateral HiFi-BBrg architectures for broader clinical adoption.
Abstract
Recent studies have shown that diffusion models produce superior synthetic images when compared to Generative Adversarial Networks (GANs). However, their outputs are often non-deterministic and lack high fidelity to the ground truth due to the inherent randomness. In this paper, we propose a novel High-fidelity Brownian bridge model (HiFi-BBrg) for deterministic medical image translations. Our model comprises two distinct yet mutually beneficial mappings: a generation mapping and a reconstruction mapping. The Brownian bridge training process is guided by the fidelity loss and adversarial training in the reconstruction mapping. This ensures that translated images can be accurately reversed to their original forms, thereby achieving consistent translations with high fidelity to the ground truth. Our extensive experiments on multiple datasets show HiFi-BBrg outperforms state-of-the-art methods in multi-modal image translation and multi-image super-resolution.
