Robust Collision Detection for Robots with Variable Stiffness Actuation by Using MAD-CNN: Modularized-Attention-Dilated Convolutional Neural Network
Zhenwei Niu, Lyes Saad Saoud, Irfan Hussain
TL;DR
MAD-CNN tackles collision detection for robots with variable stiffness actuators (VSAs) by integrating a dual inductive bias—modularized joint networks and dilated convolution—with a self-attention module to achieve data-efficient, robust performance. Trained on only four minutes of collision data at the highest stiffness, the model delivers 100% detection across stiffness levels with a mean delay around $12.0503$ ms, and benefits further from a 15 ms continuous filter to reduce false positives. Ablation results show each component (modularization, dilation, attention) contributes to improved accuracy and speed, while time-windowed inputs and per-joint processing enhance data efficiency in imbalanced collision data regimes. The approach outperforms state-of-the-art methods under limited training data, offering a practical, real-time safety solution for pHRI with VSAs and enabling safer human-robot collaboration in industrial and daily tasks.
Abstract
Ensuring safety is paramount in the field of collaborative robotics to mitigate the risks of human injury and environmental damage. Apart from collision avoidance, it is crucial for robots to rapidly detect and respond to unexpected collisions. While several learning-based collision detection methods have been introduced as alternatives to purely model-based detection techniques, there is currently a lack of such methods designed for collaborative robots equipped with variable stiffness actuators. Moreover, there is potential for further enhancing the network's robustness and improving the efficiency of data training. In this paper, we propose a new network, the Modularized Attention-Dilated Convolutional Neural Network (MAD-CNN), for collision detection in robots equipped with variable stiffness actuators. Our model incorporates a dual inductive bias mechanism and an attention module to enhance data efficiency and improve robustness. In particular, MAD-CNN is trained using only a four-minute collision dataset focusing on the highest level of joint stiffness. Despite limited training data, MAD-CNN robustly detects all collisions with minimal detection delay across various stiffness conditions. Moreover, it exhibits a higher level of collision sensitivity, which is beneficial for effectively handling false positives, which is a common issue in learning-based methods. Experimental results demonstrate that the proposed MAD-CNN model outperforms existing state-of-the-art models in terms of collision sensitivity and robustness.
