Ultrasound Image Generation using Latent Diffusion Models
Benoit Freiche, Anthony El-Khoury, Ali Nasiri-Sarvi, Mahdi S. Hosseini, Damien Garcia, Adrian Basarab, Mathieu Boily, Hassan Rivaz
TL;DR
This work tackles the scarcity of publicly available ultrasound data by generating synthetic US images using a fine-tuned latent diffusion model. It leverages Stable Diffusion trained on the BUSI breast US dataset and adds ControlNet to condition outputs on segmentation masks, providing targeted, controllable generation. Qualitative results show realistic anatomy and pathologies, while quantitative experiments demonstrate that synthetic images can improve downstream classification performance, indicating the model captures essential US statistics. The study highlights the potential of synthetic US data for training and evaluation, with future directions including memory-efficient fine-tuning, multi-dataset training, physics-based simulations, and broader community release.
Abstract
Diffusion models for image generation have been a subject of increasing interest due to their ability to generate diverse, high-quality images. Image generation has immense potential in medical imaging because open-source medical images are difficult to obtain compared to natural images, especially for rare conditions. The generated images can be used later to train classification and segmentation models. In this paper, we propose simulating realistic ultrasound (US) images by successive fine-tuning of large diffusion models on different publicly available databases. To do so, we fine-tuned Stable Diffusion, a state-of-the-art latent diffusion model, on BUSI (Breast US Images) an ultrasound breast image dataset. We successfully generated high-quality US images of the breast using simple prompts that specify the organ and pathology, which appeared realistic to three experienced US scientists and a US radiologist. Additionally, we provided user control by conditioning the model with segmentations through ControlNet. We will release the source code at http://code.sonography.ai/ to allow fast US image generation to the scientific community.
