Assured Autonomy with Neuro-Symbolic Perception
R. Spencer Hallyburton, Miroslav Pajic
TL;DR
The paper addresses the lack of assured robustness in perception for cyber-physical systems by highlighting vulnerabilities of pattern-matching DNNs and introducing a neuro-symbolic perception framework, NeuSPaPer, that combines joint object detection with scene graph generation to enable reasoning over semantic relationships. It leverages foundation models for offline knowledge extraction and specialized SGG models for real-time inference, with per-sensor and cross-sensor integrity checks guided by physics-based knowledge. Feasibility studies on nuScenes and CARLA show that scene-graph-based integrity can detect previously stealthy attacks like frustum attacks, improving resilience in multi-sensor fusion. The contributions include a vulnerability analysis, a neuro-symbolic perception and integrity architecture, and an initial feasibility demonstration that motivates future full-stack neuro-symbolic perception research for trusted autonomy in CPS.
Abstract
Many state-of-the-art AI models deployed in cyber-physical systems (CPS), while highly accurate, are simply pattern-matchers.~With limited security guarantees, there are concerns for their reliability in safety-critical and contested domains. To advance assured AI, we advocate for a paradigm shift that imbues data-driven perception models with symbolic structure, inspired by a human's ability to reason over low-level features and high-level context. We propose a neuro-symbolic paradigm for perception (NeuSPaPer) and illustrate how joint object detection and scene graph generation (SGG) yields deep scene understanding.~Powered by foundation models for offline knowledge extraction and specialized SGG algorithms for real-time deployment, we design a framework leveraging structured relational graphs that ensures the integrity of situational awareness in autonomy. Using physics-based simulators and real-world datasets, we demonstrate how SGG bridges the gap between low-level sensor perception and high-level reasoning, establishing a foundation for resilient, context-aware AI and advancing trusted autonomy in CPS.
