Underwater Acoustic Target Recognition based on Smoothness-inducing Regularization and Spectrogram-based Data Augmentation
Ji Xu, Yuan Xie, Wenchao Wang
TL;DR
The paper tackles underwater acoustic target recognition under limited data, where traditional data augmentation risks biasing models toward non-true distributions. It introduces two strategies: smoothness-inducing regularization, which leverages simulated signals only in a KL-divergence based regularization term to smooth decision boundaries, and Local Masking and Replicating (LMR), a spectrogram-specific augmentation that creates mixed inputs to capture inter-class relationships. Implemented on a ResNet-18 with multi-head attention and evaluated across Shipsear, DeepShip, and a private DTIL dataset, the approach yields consistent improvements, with LMR providing substantial gains in data-scarce settings and the combination with regularization offering the best performance. Visualizations (confusion matrices and CAM) support improved generalization and reduced local bias, and robustness analyses show enhanced resilience to perturbations. Limitations include limited gains in data-rich regimes and potential negative effects of LMR on some tasks beyond recognition. The work provides a practical benchmark and demonstrates that carefully designed regularization and spectrogram-based augmentation can achieve robust underwater target recognition with limited data.
Abstract
Underwater acoustic target recognition is a challenging task owing to the intricate underwater environments and limited data availability. Insufficient data can hinder the ability of recognition systems to support complex modeling, thus impeding their advancement. To improve the generalization capacity of recognition models, techniques such as data augmentation have been employed to simulate underwater signals and diversify data distribution. However, the complexity of underwater environments can cause the simulated signals to deviate from real scenarios, resulting in biased models that are misguided by non-true data. In this study, we propose two strategies to enhance the generalization ability of models in the case of limited data while avoiding the risk of performance degradation. First, as an alternative to traditional data augmentation, we utilize smoothness-inducing regularization, which only incorporates simulated signals in the regularization term. Additionally, we propose a specialized spectrogram-based data augmentation strategy, namely local masking and replicating (LMR), to capture inter-class relationships. Our experiments and visualization analysis demonstrate the superiority of our proposed strategies.
