CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin, Jinghe Hu, Jingping Shao
TL;DR
CBNet introduces a lightweight plug-and-play framework for segmentation-based scene text detection that improves kernel segmentation through a global–local context module and reconstructs text boundaries with a boundary-guided expansion driven by a learnable distance map. The global context models cross-instance pixel relationships, while the local context uses per-text-distance cues to refine segmentation; together they produce a stronger text kernel. The boundary-guided expansion uses a predicted distance map to adaptively grow contours, achieving favorable accuracy–speed trade-offs and simpler post-processing. Across curve, multi-oriented, and multilingual benchmarks, CBNet consistently improves performance with minimal parameter overhead, demonstrating strong generalization and practical impact for real-time text detection systems.
Abstract
Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion. However, the segmentation process only considers each pixel independently, and the expansion process is difficult to achieve a favorable accuracy-speed trade-off. In this paper, we propose a Context-aware and Boundary-guided Network (CBN) to tackle these problems. In CBN, a basic text detector is firstly used to predict initial segmentation results. Then, we propose a context-aware module to enhance text kernel feature representations, which considers both global and local contexts. Finally, we introduce a boundary-guided module to expand enhanced text kernels adaptively with only the pixels on the contours, which not only obtains accurate text boundaries but also keeps high speed, especially on high-resolution output maps. In particular, with a lightweight backbone, the basic detector equipped with our proposed CBN achieves state-of-the-art results on several popular benchmarks, and our proposed CBN can be plugged into several segmentation-based methods. Code is available at https://github.com/XiiZhao/cbn.pytorch.
