Enhancing Medical Image Segmentation via Heat Conduction Equation
Rong Wu, Yim-Sang Yu
TL;DR
This work tackles the challenge of efficient global context modeling in medical image segmentation by introducing a hybrid architecture, UMH, that combines Mamba state-space modules for long-range reasoning with heat conduction–based diffusion in the bottleneck via Heat Conduction Operators. The frequency-domain diffusion is implemented through DCT/IDCT transforms with an adaptively learned diffusion coefficient, achieving a computational profile of $O(N^{1.5})$ while maintaining interpretability. Across multimodal abdominal CT/MRI datasets, UMH delivers state-of-the-art or competitive results in 3D segmentation and demonstrates robustness for boundary refinement and global context propagation. Ablation studies confirm the complementary benefits of Mamba and HCO components, highlighting a scalable pathway that blends physics-inspired diffusion with modern sequence models for biomedical image analysis.
Abstract
Medical image segmentation has been significantly advanced by deep learning architectures, notably U-Net variants. However, existing models struggle to achieve efficient global context modeling and long-range dependency reasoning under practical computational budgets simultaneously. In this work, we propose a novel hybrid architecture utilizing U-Mamba with Heat Conduction Equation. Our model combines Mamba-based state-space modules for efficient long-range reasoning with Heat Conduction Operators (HCOs) in the bottleneck layers, simulating frequency-domain thermal diffusion for enhanced semantic abstraction. Experimental results on multimodal abdominal CT and MRI datasets demonstrate that the proposed model consistently outperforms strong baselines, validating its effectiveness and generalizability. It suggest that blending state-space dynamics with heat-based global diffusion offers a scalable and interpretable solution for medical segmentation tasks.
