Embodied Laser Attack:Leveraging Scene Priors to Achieve Agent-based Robust Non-contact Attacks
Yitong Sun, Yao Huang, Xingxing Wei
TL;DR
The paper addresses robustness gaps of physical adversarial attacks in dynamic, real-world settings, particularly for non-contact laser attacks in traffic scenarios. It proposes Embodied Laser Attack (ELA), a Perception-Decision-Control framework that uses a Perspective Transformation Network (PTN) to infer the victim view from attacker observations and a reinforcement learning agent to select laser parameters in real time. Key contributions include (1) a PTN that exploits traffic scene priors for fast, local perspective estimation, (2) an agent-based decision module trained with reinforcement learning to produce instant attack strategies, and (3) comprehensive experiments in CARLA and physically inspired scenarios showing improved attack success rates and speed over fixed or offline methods. The results highlight practical security implications for vision systems in traffic and provide a framework for evaluating robustness and concealment of non-contact adversarial attacks.
Abstract
As physical adversarial attacks become extensively applied in unearthing the potential risk of security-critical scenarios, especially in dynamic scenarios, their vulnerability to environmental variations has also been brought to light. The non-robust nature of physical adversarial attack methods brings less-than-stable performance consequently. Although methods such as EOT have enhanced the robustness of traditional contact attacks like adversarial patches, they fall short in practicality and concealment within dynamic environments such as traffic scenarios. Meanwhile, non-contact laser attacks, while offering enhanced adaptability, face constraints due to a limited optimization space for their attributes, rendering EOT less effective. This limitation underscores the necessity for developing a new strategy to augment the robustness of such practices. To address these issues, this paper introduces the Embodied Laser Attack (ELA), a novel framework that leverages the embodied intelligence paradigm of Perception-Decision-Control to dynamically tailor non-contact laser attacks. For the perception module, given the challenge of simulating the victim's view by full-image transformation, ELA has innovatively developed a local perspective transformation network, based on the intrinsic prior knowledge of traffic scenes and enables effective and efficient estimation. For the decision and control module, ELA trains an attack agent with data-driven reinforcement learning instead of adopting time-consuming heuristic algorithms, making it capable of instantaneously determining a valid attack strategy with the perceived information by well-designed rewards, which is then conducted by a controllable laser emitter. Experimentally, we apply our framework to diverse traffic scenarios both in the digital and physical world, verifying the effectiveness of our method under dynamic successive scenes.
