Holistically-Nested Edge Detection
Saining Xie, Zhuowen Tu
TL;DR
The paper tackles edge and boundary detection in natural images by introducing holistically-nested edge detection (HED), a fully convolutional, image-to-image network trained with deep supervision on multiple side outputs. By trimming a VGG-16 backbone and adding side-output branches at multiple depths, HED learns rich multi-scale representations and fuses them with a learnable weighted layer, yielding accurate edges with practical speed. Extensive experiments on BSDS500 and NYUDv2 demonstrate state-of-the-art performance and robust performance gains from multi-scale supervision, consensus labeling, and RGB-D depth encoding via HHA features. The approach achieves ~0.4 s per image on GPUs and benefits from additional training data, highlighting its practical impact for real-world edge detection tasks.
Abstract
We develop a new edge detection algorithm that tackles two important issues in this long-standing vision problem: (1) holistic image training and prediction; and (2) multi-scale and multi-level feature learning. Our proposed method, holistically-nested edge detection (HED), performs image-to-image prediction by means of a deep learning model that leverages fully convolutional neural networks and deeply-supervised nets. HED automatically learns rich hierarchical representations (guided by deep supervision on side responses) that are important in order to approach the human ability resolve the challenging ambiguity in edge and object boundary detection. We significantly advance the state-of-the-art on the BSD500 dataset (ODS F-score of .782) and the NYU Depth dataset (ODS F-score of .746), and do so with an improved speed (0.4 second per image) that is orders of magnitude faster than some recent CNN-based edge detection algorithms.
