EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation

Hadrien Reynaud; Alberto Gomez; Paul Leeson; Qingjie Meng; Bernhard Kainz

EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation

Hadrien Reynaud, Alberto Gomez, Paul Leeson, Qingjie Meng, Bernhard Kainz

TL;DR

EchoFlow introduces a privacy-preserving pipeline for cardiac ultrasound synthesis by learning a domain-specific latent space via an adversarial variational auto-encoder, then generating both images and videos through latent flow matching. A latent re-identification module screens anatomies to prevent leakage of real patient data, while downstream EF regression demonstrates that models trained exclusively on EchoFlow synthetic data can match real-data performance. The framework is validated across multiple public echocardiogram datasets, showing that scaling model size and training time closes the gap between synthetic and real data in clinical tasks. By releasing both models and synthetic datasets, EchoFlow provides a foundation for privacy-compliant research in medical ultrasound and sets a path for broader synthetic-data utility in healthcare AI.

Abstract

Advances in deep learning have significantly enhanced medical image analysis, yet the availability of large-scale medical datasets remains constrained by patient privacy concerns. We present EchoFlow, a novel framework designed to generate high-quality, privacy-preserving synthetic echocardiogram images and videos. EchoFlow comprises four key components: an adversarial variational autoencoder for defining an efficient latent representation of cardiac ultrasound images, a latent image flow matching model for generating accurate latent echocardiogram images, a latent re-identification model to ensure privacy by filtering images anatomically, and a latent video flow matching model for animating latent images into realistic echocardiogram videos conditioned on ejection fraction. We rigorously evaluate our synthetic datasets on the clinically relevant task of ejection fraction regression and demonstrate, for the first time, that downstream models trained exclusively on EchoFlow-generated synthetic datasets achieve performance parity with models trained on real datasets. We release our models and synthetic datasets, enabling broader, privacy-compliant research in medical ultrasound imaging at https://huggingface.co/spaces/HReynaud/EchoFlow.

EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation

TL;DR

Abstract

EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)