LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation
Weibin Liao, Yinghao Zhu, Xinyuan Wang, Chengwei Pan, Yasha Wang, Liantao Ma
TL;DR
This paper tackles the computational burden of high-capacity segmentation models by introducing LightM-UNet, a lightweight UNet variant that substitutes CNN/Transformer components with Mamba-based blocks to capture global context with linear complexity. The architecture employs Residual Vision Mamba Layers and a Vision State-Space Module within an encoder–bottleneck–decoder framework, enabling deep semantic feature extraction while maintaining a small parameter footprint (~1M). Extensive experiments on 2D Montgomery&Shenzhen and 3D LiTS demonstrate state-of-the-art performance with dramatic reductions in parameters and computation compared to nnU-Net and U-Mamba. The work substantiates the viability of Mamba as a lightweight backbone for medical image segmentation and highlights its potential for mobile health applications.
Abstract
UNet and its variants have been widely used in medical image segmentation. However, these models, especially those based on Transformer architectures, pose challenges due to their large number of parameters and computational loads, making them unsuitable for mobile health applications. Recently, State Space Models (SSMs), exemplified by Mamba, have emerged as competitive alternatives to CNN and Transformer architectures. Building upon this, we employ Mamba as a lightweight substitute for CNN and Transformer within UNet, aiming at tackling challenges stemming from computational resource limitations in real medical settings. To this end, we introduce the Lightweight Mamba UNet (LightM-UNet) that integrates Mamba and UNet in a lightweight framework. Specifically, LightM-UNet leverages the Residual Vision Mamba Layer in a pure Mamba fashion to extract deep semantic features and model long-range spatial dependencies, with linear computational complexity. Extensive experiments conducted on two real-world 2D/3D datasets demonstrate that LightM-UNet surpasses existing state-of-the-art literature. Notably, when compared to the renowned nnU-Net, LightM-UNet achieves superior segmentation performance while drastically reducing parameter and computation costs by 116x and 21x, respectively. This highlights the potential of Mamba in facilitating model lightweighting. Our code implementation is publicly available at https://github.com/MrBlankness/LightM-UNet.
