Efficient Visual Fault Detection for Freight Train via Neural Architecture Search with Data Volume Robustness
Yang Zhang, Mingying Li, Huilin Pan, Moyun Liu, Yang Zhou
TL;DR
The paper tackles visual fault detection for freight trains by designing NAS FTI-FDet, an efficient NAS framework that searches for a task-specific, multi-scale detection head. It introduces a scale-aware search space and a representation-sharing scheme to handle large receptive-field variations while reducing memory and search time, and it demonstrates data-volume robustness by achieving competitive accuracy with reduced datasets. Empirically, the method attains 46.8 mAP on Bottom View and 47.9 mAP on Side View, outperforming several hand-crafted and NAS-based baselines, with linear reductions in search cost as data volume decreases. The work suggests practical benefits for industrial settings with limited data and resources, and discusses extensions to illumination robustness and backbone-aware NAS.
Abstract
Deep learning-based fault detection methods have achieved significant success. In visual fault detection of freight trains, there exists a large characteristic difference between inter-class components (scale variance) but intra-class on the contrary, which entails scale-awareness for detectors. Moreover, the design of task-specific networks heavily relies on human expertise. As a consequence, neural architecture search (NAS) that automates the model design process gains considerable attention because of its promising performance. However, NAS is computationally intensive due to the large search space and huge data volume. In this work, we propose an efficient NAS-based framework for visual fault detection of freight trains to search for the task-specific detection head with capacities of multi-scale representation. First, we design a scale-aware search space for discovering an effective receptive field in the head. Second, we explore the robustness of data volume to reduce search costs based on the specifically designed search space, and a novel sharing strategy is proposed to reduce memory and further improve search efficiency. Extensive experimental results demonstrate the effectiveness of our method with data volume robustness, which achieves 46.8 and 47.9 mAP on the Bottom View and Side View datasets, respectively. Our framework outperforms the state-of-the-art approaches and linearly decreases the search costs with reduced data volumes.
