Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network
Rasha Alshawi, Md Meftahul Ferdaus, Mahdi Abdelguerfi, Kendall Niles, Ken Pathak, Steve Sloan
TL;DR
This work tackles the challenge of imbalanced defect data in culvert and sewer segmentation by proposing the Enhanced Feature Pyramid Network (E-FPN), a dual-pathway architecture that combines a sparsely connected, Inception-like bottom-up block with depth-wise separable convolutions and a high-resolution top-down fusion path. It further mitigates imbalance through class decomposition and targeted data augmentation, validated on the Culvert-Sewer Defects dataset and an aerial drone dataset, achieving substantial IoU gains and dramatically reduced parameter counts. Key contributions include a rigorous ablation study showing the benefit of multi-scale depthwise blocks, and the demonstration that combining CD and DA yields the largest performance gains ($ ext{IoU}$ improvement of $≈6.97 ext%$ on average). The model offers a practical, scalable solution for infrastructure inspection, with implications for broader multi-class segmentation in imbalanced real-world datasets and potential extensions to temporal, unsupervised, and physics-informed approaches.
Abstract
Imbalanced datasets are a significant challenge in real-world scenarios. They lead to models that underperform on underrepresented classes, which is a critical issue in infrastructure inspection. This paper introduces the Enhanced Feature Pyramid Network (E-FPN), a deep learning model for the semantic segmentation of culverts and sewer pipes within imbalanced datasets. The E-FPN incorporates architectural innovations like sparsely connected blocks and depth-wise separable convolutions to improve feature extraction and handle object variations. To address dataset imbalance, the model employs strategies like class decomposition and data augmentation. Experimental results on the culvert-sewer defects dataset and a benchmark aerial semantic segmentation drone dataset show that the E-FPN outperforms state-of-the-art methods, achieving an average Intersection over Union (IoU) improvement of 13.8% and 27.2%, respectively. Additionally, class decomposition and data augmentation together boost the model's performance by approximately 6.9% IoU. The proposed E-FPN presents a promising solution for enhancing object segmentation in challenging, multi-class real-world datasets, with potential applications extending beyond culvert-sewer defect detection.
