Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation
Feng Zhou, Yanjie Zhou, Longjie Wang, Yun Peng, David E. Carlson, Liyun Tu
TL;DR
This work tackles the challenge of one-shot medical image segmentation by introducing a reconstruction-guided distillation framework. It combines a registration-based data augmentation network with a teacher–student distillation paradigm where a reconstruction-focused teacher guides a segmentation-focused student via feature alignment and a cosine-similarity-based hint loss. The approach leverages real unlabeled images to learn anatomy-rich representations and uses a lightweight inference model for efficient deployment, achieving superior generalization across brain MRI, abdominal CT, and vertebrae CT datasets compared to state-of-the-art one-shot methods. The results indicate strong potential for robust MIS in data-scarce and cross-modality scenarios, with implications for broader clinical applicability and downstream diagnostic tasks.
Abstract
Traditional one-shot medical image segmentation (MIS) methods use registration networks to propagate labels from a reference atlas or rely on comprehensive sampling strategies to generate synthetic labeled data for training. However, these methods often struggle with registration errors and low-quality synthetic images, leading to poor performance and generalization. To overcome this, we introduce a novel one-shot MIS framework based on knowledge distillation, which allows the network to directly 'see' real images through a distillation process guided by image reconstruction. It focuses on anatomical structures in a single labeled image and a few unlabeled ones. A registration-based data augmentation network creates realistic, labeled samples, while a feature distillation module helps the student network learn segmentation from these samples, guided by the teacher network. During inference, the streamlined student network accurately segments new images. Evaluations on three public datasets (OASIS for T1 brain MRI, BCV for abdomen CT, and VerSe for vertebrae CT) show superior segmentation performance and generalization across different medical image datasets and modalities compared to leading methods. Our code is available at https://github.com/NoviceFodder/OS-MedSeg.
