Object-Size-Driven Design of Convolutional Neural Networks: Virtual Axle Detection based on Raw Data
Henik Riedel, Robert Steven Lorenzen, Clemens Hübler
TL;DR
The paper tackles real-time axle detection for Bridge Weigh-In-Motion (BWIM) without dedicated axle detectors by introducing VADER, a U-Net–based FCN that operates on raw acceleration data, and the Maximum Receptive Field (MRF) rule to constrain hyperparameters based on the bridge's fundamental frequency. It demonstrates that using raw data yields dramatic gains in speed (≈65× faster) and memory (≈1% of the raw-input footprint of spectrograms) while maintaining high detection accuracy, achieving up to 99.9% axle detection with a mean spatial error around 3.69 cm in favorable sensor conditions. The work compares raw data to spectrogram-based inputs across stratified and DGPS labeling scenarios, showing that raw-input models generalize better and are more robust to sensor degradation. Beyond axle detection, the MRF rule provides a theoretically grounded approach to hyperparameter tuning that could apply to other unstructured data problems, potentially reducing the need for extensive hyperparameter searches in diverse domains.
Abstract
As infrastructure ages, the need for efficient monitoring methods becomes increasingly critical. Bridge Weigh-In-Motion (BWIM) systems are crucial for cost-effective determination of loads and, consequently, the residual service life of road and railway infrastructure. However, conventional BWIM systems require additional sensors for axle detection, which must be installed in potentially inaccessible locations or places that interfere with bridge operation. This study presents a novel approach for real-time detection of train axles using sensors arbitrarily placed on bridges, providing an alternative to dedicated axle detectors. The developed Virtual Axle Detector with Enhanced Receptive Field (VADER) has been validated on a single-track railway bridge using only acceleration measurements, detecting 99.9% of axles with a spatial error of 3.69cm. Using raw data as input outperformed the state-of-the-art spectrogram-based method in both speed and memory usage by 99%, thereby making real-time application feasible for the first time. Additionally, we introduce the Maximum Receptive Field (MRF) rule, a novel approach to optimise hyperparameters of Convolutional Neural Networks (CNNs) based on the size of objects. In this context, the object size relates to the fundamental frequency of a bridge. The MRF rule effectively narrows the hyperparameter search space, overcoming the need for extensive hyperparameter tuning. Since the MRF rule can theoretically be applied to all unstructured data, it could have implications for a wide range of deep learning problems, from earthquake prediction to object recognition.
