Introducing VaDA: Novel Image Segmentation Model for Maritime Object Segmentation Using New Dataset
Yongjin Kim, Jinbum Park, Sanha Kang, Hanguen Kim
TL;DR
This work introduces VaDA, a maritime object segmentation model that leverages Vertical and Detail Attention to boost performance in diverse sea conditions. It also proposes IFCP, a holistic real-time evaluation metric, and introduces the OASIs dataset to benchmark segmentation under day, adverse weather, and night scenarios. VaDA achieves state-of-the-art results on OASIs with an IFCP of $0.6422$ and mIoU of $0.7993$, while maintaining practical edge-device feasibility. Together, these contributions advance robust, real-time maritime perception and provide a standardized dataset and evaluation framework for the community.
Abstract
The maritime shipping industry is undergoing rapid evolution driven by advancements in computer vision artificial intelligence (AI). Consequently, research on AI-based object recognition models for maritime transportation is steadily growing, leveraging advancements in sensor technology and computing performance. However, object recognition in maritime environments faces challenges such as light reflection, interference, intense lighting, and various weather conditions. To address these challenges, high-performance deep learning algorithms tailored to maritime imagery and high-quality datasets specialized for maritime scenes are essential. Existing AI recognition models and datasets have limited suitability for composing autonomous navigation systems. Therefore, in this paper, we propose a Vertical and Detail Attention (VaDA) model for maritime object segmentation and a new model evaluation method, the Integrated Figure of Calculation Performance (IFCP), to verify its suitability for the system in real-time. Additionally, we introduce a benchmark maritime dataset, OASIs (Ocean AI Segmentation Initiatives) to standardize model performance evaluation across diverse maritime environments. OASIs dataset and details are available at our website: https://www.navlue.com/dataset
