Self-supervised Brain Lesion Generation for Effective Data Augmentation of Medical Images
Jiayu Huo, Sebastien Ourselin, Rachel Sparks
TL;DR
The paper tackles data scarcity in brain lesion segmentation by introducing a three-stage framework that combines self-supervised lesion synthesis, Soft Poisson Blending for seamless lesion insertion, and prototype-consistency regularization to align real and synthetic lesion features. It leverages a two-stage adversarial autoencoder to generate plausible lesion masks and textures, uses SPB to produce realistic composites, and enforces feature alignment with prototype losses during segmentation training. Across ATLAS v2.0 and Shift MS datasets, the approach achieves state-of-the-art segmentation performance and excels on small datasets where conventional augmentation struggles, including a Dice improvement from 50.36% to 60.23% on ATLAS v2.0. The work demonstrates that high-fidelity synthetic data, when properly blended and aligned in feature space, can substantially boost brain lesion segmentation and generalization, with potential for extension to other modalities and organs.
Abstract
Accurate brain lesion delineation is important for planning neurosurgical treatment. Automatic brain lesion segmentation methods based on convolutional neural networks have demonstrated remarkable performance. However, neural network performance is constrained by the lack of large-scale well-annotated training datasets. In this manuscript, we propose a comprehensive framework to efficiently generate new samples for training a brain lesion segmentation model. We first train a lesion generator, based on an adversarial autoencoder, in a self-supervised manner. Next, we utilize a novel image composition algorithm, Soft Poisson Blending, to seamlessly combine synthetic lesions and brain images to obtain training samples. Finally, to effectively train the brain lesion segmentation model with augmented images we introduce a new prototype consistence regularization to align real and synthetic features. Our framework is validated by extensive experiments on two public brain lesion segmentation datasets: ATLAS v2.0 and Shift MS. Our method outperforms existing brain image data augmentation schemes. For instance, our method improves the Dice from 50.36% to 60.23% compared to the U-Net with conventional data augmentation techniques for the ATLAS v2.0 dataset.
