CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation

Elay Dahan; Hedda Cohen Indelman; Angeles M. Perez-Agosto; Carmit Shiran; Gopal Avinash; Doron Shaked; Nati Daniel

CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation

Elay Dahan, Hedda Cohen Indelman, Angeles M. Perez-Agosto, Carmit Shiran, Gopal Avinash, Doron Shaked, Nati Daniel

TL;DR

This work introduces Context-Semantic Guidance (CSG), a dual-conditioning diffusion framework for de novo musculoskeletal ultrasound image generation that jointly controls anatomy via semantic masks and texture via context guidance. By combining a fine-tuned StyleGAN mask generator with context-aware texture selection and a paired latent diffusion translator, CSG produces high-fidelity images including pathological findings. Three-fold validation shows improved segmentation performance, higher fidelity to real images (lower FID and related metrics), and realistic appearance in Turing tests, relative to prior methods. An extension enables text-guided geometry editing and texture augmentation to broaden the variability space, potentially enhancing robustness of ultrasound AI systems.

Abstract

The use of synthetic images in medical imaging Artificial Intelligence (AI) solutions has been shown to be beneficial in addressing the limited availability of diverse, unbiased, and representative data. Despite the extensive use of synthetic image generation methods, controlling the semantics variability and context details remains challenging, limiting their effectiveness in producing diverse and representative medical image datasets. In this work, we introduce a scalable semantic and context-conditioned generative model, coined CSG (Context-Semantic Guidance). This dual conditioning approach allows for comprehensive control over both structure and appearance, advancing the synthesis of realistic and diverse ultrasound images. We demonstrate the ability of CSG to generate findings (pathological anomalies) in musculoskeletal (MSK) ultrasound images. Moreover, we test the quality of the synthetic images using a three-fold validation protocol. The results show that the synthetic images generated by CSG improve the performance of semantic segmentation models, exhibit enhanced similarity to real images compared to the baseline methods, and are undistinguishable from real images according to a Turing test. Furthermore, we demonstrate an extension of the CSG that allows enhancing the variability space of images by synthetically generating augmentations of anatomical geometries and textures.

CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation

TL;DR

Abstract

CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)