Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios
Chirag Parikh, Ravi Shankar Mishra, Rohan Chandra, Ravi Kiran Sarvadevabhatla
TL;DR
Transfer-LMR tackles heavy-tail driving behavior recognition in low-resource dashcam datasets by a two-stage training pipeline that first learns general video features and then refines tail-class representations with Long-Tailed Mixed Reconstruction (LMR). It introduces boundary-constrained data augmentation to preserve peripheral context, temporal feature aggregation, and LMR-based feature refinement to diversify tail-class features without sacrificing overall mean average precision. Evaluated on METEOR and HDD, it delivers substantial gains in average class accuracy across multiple backbones, notably improving tail-class predictions such as CT, WL, and DT, and achieves state-of-the-art results with I3D pre-trained on Kinetics-400. The method demonstrates backbone-agnostic applicability and practical inference speed, making it a robust solution for robust planning and safety in diverse traffic scenarios.
Abstract
Recognizing driving behaviors is important for downstream tasks such as reasoning, planning, and navigation. Existing video recognition approaches work well for common behaviors (e.g. "drive straight", "brake", "turn left/right"). However, the performance is sub-par for underrepresented/rare behaviors typically found in tail of the behavior class distribution. To address this shortcoming, we propose Transfer-LMR, a modular training routine for improving the recognition performance across all driving behavior classes. We extensively evaluate our approach on METEOR and HDD datasets that contain rich yet heavy-tailed distribution of driving behaviors and span diverse traffic scenarios. The experimental results demonstrate the efficacy of our approach, especially for recognizing underrepresented/rare driving behaviors.
