Model Agnostic Preference Optimization for Medical Image Segmentation
Yunseong Nam, Jiwon Jang, Dongkyu Won, Sang Hyun Park, Soopil Kim
TL;DR
The paper tackles domain shifts and data scarcity in medical image segmentation by proposing MAPO, a model-agnostic, dropout-driven preference optimization framework. MAPO generates diverse predictions via dropout, creates preference pairs online, and optimizes with a Direct Preference Optimization objective combined with standard segmentation losses, ensuring stable training. Across multiple 2D and 3D datasets and architectures (CNNs, Transformers, hybrids), MAPO yields consistent Dice improvements and reduced boundary errors (ASD), while also stabilizing optimization on challenging datasets. The approach eliminates reliance on architecture-specific sampling techniques and demonstrates strong practical potential for robust, generalizable medical image segmentation.
Abstract
Preference optimization offers a scalable supervision paradigm based on relative preference signals, yet prior attempts in medical image segmentation remain model-specific and rely on low-diversity prediction sampling. In this paper, we propose MAPO (Model-Agnostic Preference Optimization), a training framework that utilizes Dropout-driven stochastic segmentation hypotheses to construct preference-consistent gradients without direct ground-truth supervision. MAPO is fully architecture- and dimensionality-agnostic, supporting 2D/3D CNN and Transformer-based segmentation pipelines. Comprehensive evaluations across diverse medical datasets reveal that MAPO consistently enhances boundary adherence, reduces overfitting, and yields more stable optimization dynamics compared to conventional supervised training.
