Cycle Pixel Difference Network for Crisp Edge Detection
Changsong Liu, Wei Zhang, Yanyan Liu, Mingyang Li, Wenlin Li, Yimeng Fan, Xiangnan Bai, Liang Zhang
TL;DR
Cycle Pixel Difference Network (CPD-Net) targets two persistent edge-detection challenges: reliance on large pre-trained weights and producing thick edges. It introduces Cycle Pixel Difference Convolution (CPDC) to encode edge priors from four directions, enabling scratch training, and couples this with a Multi-scale Information Enhancement Module (MSEM) and a Dual Residual Connection (DRC) decoder to sharpen edge localization. A Hybrid Focal Loss combines focal Tversky and focal loss to address pixel imbalance, further improving contour fidelity. Evaluations on BSDS500, NYUD-V2, BIPED, and CID show competitive performance without pretraining and good edge crispness, with CPD-Net achieving strong results while maintaining a compact, efficient model. The work proposes a practical, resource-efficient edge detector that closely adheres to ground-truth contours and offers potential for real-time applications in constrained environments.
Abstract
Edge detection, as a fundamental task in computer vision, has garnered increasing attention. The advent of deep learning has significantly advanced this field. However, recent deep learning-based methods generally face two significant issues: 1) reliance on large-scale pre-trained weights, and 2) generation of thick edges. We construct a U-shape encoder-decoder model named CPD-Net that successfully addresses these two issues simultaneously. In response to issue 1), we propose a novel cycle pixel difference convolution (CPDC), which effectively integrates edge prior knowledge with modern convolution operations, consequently successfully eliminating the dependence on large-scale pre-trained weights. As for issue 2), we construct a multi-scale information enhancement module (MSEM) and a dual residual connection-based (DRC) decoder to enhance the edge location ability of the model, thereby generating crisp and clean contour maps. Comprehensive experiments conducted on four standard benchmarks demonstrate that our method achieves competitive performance on the BSDS500 dataset (ODS=0.813 and AC=0.352), NYUD-V2 (ODS=0.760 and AC=0.223), BIPED dataset (ODS=0.898 and AC=0.426), and CID (ODS=0.59). Our approach provides a novel perspective for addressing these challenges in edge detection.
