Knowledge Transfer for Collaborative Misbehavior Detection in Untrusted Vehicular Environments
Roshan Sedar, Charalampos Kalalas, Paolo Dini, Francisco Vazquez-Gallego, Jesus Alonso-Zarate, Luis Alonso
TL;DR
The paper tackles robust misbehavior detection in untrusted vehicular networks by introducing a DRL-based detector at RSUs augmented with selective transfer learning. It defines a semantic relatedness trust mechanism to identify trustworthy source RSUs and uses instance-level knowledge transfer to accelerate learning while avoiding negative transfer from adversaries. Across VeReMi-based experiments and three scenario types, the approach achieves faster convergence and high detection performance, including for unseen and partially observable attacks, outperforming tabula rasa baselines. The work demonstrates significant improvements in robustness, generalization, and data-efficiency, with practical implications for scalable edge security in V2X systems.
Abstract
Vehicular mobility underscores the need for collaborative misbehavior detection at the vehicular edge. However, locally trained misbehavior detection models are susceptible to adversarial attacks that aim to deliberately influence learning outcomes. In this paper, we introduce a deep reinforcement learning-based approach that employs transfer learning for collaborative misbehavior detection among roadside units (RSUs). In the presence of label-flipping and policy induction attacks, we perform selective knowledge transfer from trustworthy source RSUs to foster relevant expertise in misbehavior detection and avoid negative knowledge sharing from adversary-influenced RSUs. The performance of our proposed scheme is demonstrated with evaluations over a diverse set of misbehavior detection scenarios using an open-source dataset. Experimental results show that our approach significantly reduces the training time at the target RSU and achieves superior detection performance compared to the baseline scheme with tabula rasa learning. Enhanced robustness and generalizability can also be attained, by effectively detecting previously unseen and partially observable misbehavior attacks.
