MedicoSAM: Robust Improvement of SAM for Medical Imaging
Anwai Archit, Luca Freckmann, Constantin Pape
TL;DR
This work systematically evaluates how finetuning Segment Anything Model (SAM) on large medical datasets affects interactive and semantic segmentation across 2D and 3D modalities. It introduces MedicoSAM, a finetuned model with a full iterative training objective that balances box and mask prompts, achieving robust improvements in interactive segmentation while preserving compatibility with annotation tools. The study also explores domain-specific pretraining for semantic segmentation, finding modest gains that sometimes lag behind strong 3D baselines like nnU-Net. Publicly releasing MedicoSAM, the paper highlights practical pathways to adapt foundation models to medical imaging while stressing the importance of maintaining tool interoperability for real-world annotation workflows.
Abstract
Medical image segmentation is an important analysis task in clinical practice and research. Deep learning has massively advanced the field, but current approaches are mostly based on models trained for a specific task. Training such models or adapting them to a new condition is costly due to the need for (manually) labeled data. The emergence of vision foundation models, especially Segment Anything, offers a path to universal segmentation for medical images, overcoming these issues. Here, we study how to improve Segment Anything for medical images by comparing different finetuning strategies on a large and diverse dataset. We evaluate the finetuned models on a wide range of interactive and (automatic) semantic segmentation tasks. We find that the performance can be clearly improved for interactive segmentation. However, semantic segmentation does not benefit from pretraining on medical images. Our best model, MedicoSAM, is publicly available at https://github.com/computational-cell-analytics/medico-sam. We show that it is compatible with existing tools for data annotation and believe that it will be of great practical value.
