PolypFlow: Reinforcing Polyp Segmentation with Flow-Driven Dynamics
Pu Wang, Huaizhi Ma, Zhihua Zhang, Zhuoran Zheng
TL;DR
PolypFlow tackles the challenge of segmenting polyps with irregular shapes and boundaries by introducing a flow-matching refinement that solves the ODE $d\phi_t/dt = v_t(\phi_t)$ to progressively align coarse predictions with ground truth. The vector field $v$ blends self-attention-driven weighting with frequency-domain priors (DCT) and is integrated with U-Net features to produce an interpretable, trajectory-based refinement over $N=10$ steps, guided by a loss that combines weighted IoU and BCE terms. This approach yields boundary-aware robustness and clear visualization of the refinement process, and achieves state-of-the-art performance across five benchmarks with strong generalization to unseen datasets. The results indicate substantial gains from incorporating flow dynamics and frequency-aware features, suggesting practical impact for reliable, edge-friendly polyp segmentation in diverse clinical conditions.
Abstract
Accurate polyp segmentation remains challenging due to irregular lesion morphologies, ambiguous boundaries, and heterogeneous imaging conditions. While U-Net variants excel at local feature fusion, they often lack explicit mechanisms to model the dynamic evolution of segmentation confidence under uncertainty. Inspired by the interpretable nature of flow-based models, we present \textbf{PolypFLow}, a flow-matching enhanced architecture that injects physics-inspired optimization dynamics into segmentation refinement. Unlike conventional cascaded networks, our framework solves an ordinary differential equation (ODE) to progressively align coarse initial predictions with ground truth masks through learned velocity fields. This trajectory-based refinement offers two key advantages: 1) Interpretable Optimization: Intermediate flow steps visualize how the model corrects under-segmented regions and sharpens boundaries at each ODE-solver iteration, demystifying the ``black-box" refinement process; 2) Boundary-Aware Robustness: The flow dynamics explicitly model gradient directions along polyp edges, enhancing resilience to low-contrast regions and motion artifacts. Numerous experimental results show that PolypFLow achieves a state-of-the-art while maintaining consistent performance in different lighting scenarios.
