Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima, Jan Van Eijgen, Lennert Beeckmans, Thomas Jacobs, Moti Freiman, Luis Filipe Nakayama, Ingeborg Stalmans, Chaim Baskin, Joachim A. Behar
TL;DR
This work tackles generalization gaps in retinal vessel segmentation due to limited annotated data by introducing RLAD, a diffusion-based framework that generates layout-conditioned retinal fundus images. RLAD operates in a frozen VAE latent space and conditions generation on multiple retinal structures (AV, CD, L) extracted from real images, enabling realistic, controllable data augmentation for AV segmentation. The approach yields robustness gains across backbones (up to 8.1% in reported metrics) and introduces REYIA, a large AV-segmented dataset of 586 images, with code and data to support reproducibility. Overall, RLAD demonstrates that targeted synthetic data can substantially improve segmentation generalization and paves the way for extending layout-aware diffusion to other medical imaging tasks.
Abstract
Generalization in medical segmentation models is challenging due to limited annotated datasets and imaging variability. To address this, we propose Retinal Layout-Aware Diffusion (RLAD), a novel diffusion-based framework for generating controllable layout-aware images. RLAD conditions image generation on multiple key layout components extracted from real images, ensuring high structural fidelity while enabling diversity in other components. Applied to retinal fundus imaging, we augmented the training datasets by synthesizing paired retinal images and vessel segmentations conditioned on extracted blood vessels from real images, while varying other layout components such as lesions and the optic disc. Experiments demonstrated that RLAD-generated data improved generalization in retinal vessel segmentation by up to 8.1%. Furthermore, we present REYIA, a comprehensive dataset comprising 586 manually segmented retinal images. To foster reproducibility and drive innovation, both our code and dataset will be made publicly accessible.
