Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training

Luca Ciampi; Gabriele Lagani; Giuseppe Amato; Fabrizio Falchi

Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training

Luca Ciampi, Gabriele Lagani, Giuseppe Amato, Fabrizio Falchi

TL;DR

The paper tackles label-efficient biomedical image segmentation by introducing a diffusion-inspired semi-supervised framework that fuses a two-pathway, unsupervised teacher pretraining with a cross pseudo-supervision teacher–student co-training loop. It further strengthens pseudo-label quality through multi-round diffusion-based refinements, yielding strong performance on multiple 2D datasets (GlaS, PH2, HMEPS) and a 3D MRI dataset (LA) under limited annotations. Key contributions include cycle-consistency-based unsupervised teacher pretraining, CPS-driven semi-supervised co-training, and iterative diffusion rounds with alignment and reconstruction losses, all validated against SOTA baselines. The method improves data efficiency in medical image segmentation and demonstrates robust applicability to both 2D and 3D data, with code provided for reproducibility.

Abstract

Supervised deep learning for semantic segmentation has achieved excellent results in accurately identifying anatomical and pathological structures in medical images. However, it often requires large annotated training datasets, which limits its scalability in clinical settings. To address this challenge, semi-supervised learning is a well-established approach that leverages both labeled and unlabeled data. In this paper, we introduce a novel semi-supervised teacher-student framework for biomedical image segmentation, inspired by the recent success of generative models. Our approach leverages denoising diffusion probabilistic models (DDPMs) to generate segmentation masks by progressively refining noisy inputs conditioned on the corresponding images. The teacher model is first trained in an unsupervised manner using a cycle-consistency constraint based on noise-corrupted image reconstruction, enabling it to generate informative semantic masks. Subsequently, the teacher is integrated into a co-training process with a twin-student network. The student learns from ground-truth labels when available and from teacher-generated pseudo-labels otherwise, while the teacher continuously improves its pseudo-labeling capabilities. Finally, to further enhance performance, we introduce a multi-round pseudo-label generation strategy that iteratively improves the pseudo-labeling process. We evaluate our approach on multiple biomedical imaging benchmarks, spanning multiple imaging modalities and segmentation tasks. Experimental results show that our method consistently outperforms state-of-the-art semi-supervised techniques, highlighting its effectiveness in scenarios with limited annotated data. The code to replicate our experiments can be found at https://github.com/ciampluca/diffusion_semi_supervised_biomedical_image_segmentation

Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training

TL;DR

Abstract

Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)