Table of Contents
Fetching ...

Inland Waterway Object Detection in Multi-environment: Dataset and Approach

Shanshan Wang, Haixiang Xu, Hui Feng, Xiaoqian Wang, Pei Song, Sijie Liu, Jianhua He

TL;DR

A scene-guided image enhancement module to improve water surface images based on environmental conditions adaptively and a parameter-limited dilated convolution enhances the representation of vessel features, while a multi-scale diluted residual fusion method integrates multi-scale features for better detection.

Abstract

The success of deep learning in intelligent ship visual perception relies heavily on rich image data. However, dedicated datasets for inland waterway vessels remain scarce, limiting the adaptability of visual perception systems in complex environments. Inland waterways, characterized by narrow channels, variable weather, and urban interference, pose significant challenges to object detection systems based on existing datasets. To address these issues, this paper introduces the Multi-environment Inland Waterway Vessel Dataset (MEIWVD), comprising 32,478 high-quality images from diverse scenarios, including sunny, rainy, foggy, and artificial lighting conditions. MEIWVD covers common vessel types in the Yangtze River Basin, emphasizing diversity, sample independence, environmental complexity, and multi-scale characteristics, making it a robust benchmark for vessel detection. Leveraging MEIWVD, this paper proposes a scene-guided image enhancement module to improve water surface images based on environmental conditions adaptively. Additionally, a parameter-limited dilated convolution enhances the representation of vessel features, while a multi-scale dilated residual fusion method integrates multi-scale features for better detection. Experiments show that MEIWVD provides a more rigorous benchmark for object detection algorithms, and the proposed methods significantly improve detector performance, especially in complex multi-environment scenarios.

Inland Waterway Object Detection in Multi-environment: Dataset and Approach

TL;DR

A scene-guided image enhancement module to improve water surface images based on environmental conditions adaptively and a parameter-limited dilated convolution enhances the representation of vessel features, while a multi-scale diluted residual fusion method integrates multi-scale features for better detection.

Abstract

The success of deep learning in intelligent ship visual perception relies heavily on rich image data. However, dedicated datasets for inland waterway vessels remain scarce, limiting the adaptability of visual perception systems in complex environments. Inland waterways, characterized by narrow channels, variable weather, and urban interference, pose significant challenges to object detection systems based on existing datasets. To address these issues, this paper introduces the Multi-environment Inland Waterway Vessel Dataset (MEIWVD), comprising 32,478 high-quality images from diverse scenarios, including sunny, rainy, foggy, and artificial lighting conditions. MEIWVD covers common vessel types in the Yangtze River Basin, emphasizing diversity, sample independence, environmental complexity, and multi-scale characteristics, making it a robust benchmark for vessel detection. Leveraging MEIWVD, this paper proposes a scene-guided image enhancement module to improve water surface images based on environmental conditions adaptively. Additionally, a parameter-limited dilated convolution enhances the representation of vessel features, while a multi-scale dilated residual fusion method integrates multi-scale features for better detection. Experiments show that MEIWVD provides a more rigorous benchmark for object detection algorithms, and the proposed methods significantly improve detector performance, especially in complex multi-environment scenarios.

Paper Structure

This paper contains 24 sections, 10 equations, 11 figures, 5 tables.

Figures (11)

  • Figure Fig.1: Representative samples illustrating diverse viewing angles, illumination conditions, meteorological variations, and scenarios from MEIWVD.
  • Figure Fig.2: Temporal distribution and percentage representation of image acquisition time points in the MEIWVD.
  • Figure Fig.3: Temporal distribution and percentage representation of image acquisition time points in the MEIWVD.
  • Figure Fig.4: Examples of multi-environment scenarios in the MEIWVD.
  • Figure Fig.5: Relative multi-scale distribution of surface objects in the SeaShips and MEIWVD.
  • ...and 6 more figures