ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation
Ke Zhang, Vishal M. Patel
TL;DR
The paper tackles the annotation bottleneck in medical image segmentation by leveraging few-scribble labels and exploiting cross-task correlations. It introduces ModelMix, a model-mixup strategy that linearly interpolates encoder parameters from task-specific models to form virtual mixed models, guided by vicinal risk minimization. Two vicinal regularization terms are proposed: an unsupervised consistency term and a scribble-supervised term, plus an invariant loss on image-level augmentations. On three cardiac MRI datasets (ACDC, MSCMRseg, MyoPS), ModelMix achieves state-of-the-art performance under few-scribble supervision, demonstrating effective cross-task priors and robustness.
Abstract
Pixel-level dense labeling is both resource-intensive and time-consuming, whereas weak labels such as scribble present a more feasible alternative to full annotations. However, training segmentation networks with weak supervision from scribbles remains challenging. Inspired by the fact that different segmentation tasks can be correlated with each other, we introduce a new approach to few-scribble supervised segmentation based on model parameter interpolation, termed as ModelMix. Leveraging the prior knowledge that linearly interpolating convolution kernels and bias terms should result in linear interpolations of the corresponding feature vectors, ModelMix constructs virtual models using convex combinations of convolutional parameters from separate encoders. We then regularize the model set to minimize vicinal risk across tasks in both unsupervised and scribble-supervised way. Validated on three open datasets, i.e., ACDC, MSCMRseg, and MyoPS, our few-scribble guided ModelMix significantly surpasses the performance of the state-of-the-art scribble supervised methods.
