A Neurosymbolic Framework for Interpretable Cognitive Attack Detection in Augmented Reality
Rongqian Chen, Allison Andreyev, Yanming Xiu, Joshua Chilukuri, Shunav Sen, Mahdi Imani, Bin Li, Maria Gorlatova, Gang Tan, Tian Lan
TL;DR
This work tackles AR/MR cognitive attacks that distort semantic scene understanding, introducing CADAR, a neuro-symbolic framework that combines vision–language outputs with a symbolic perception graph and particle-filter temporal reasoning for interpretable attack detection. By converting multimodal observations into a structured spatiotemporal graph and applying probabilistic reasoning to detect semantic anomalies, CADAR achieves robust performance across diverse scenes and attack types. The authors also release CADAR-50K, the first public AR cognitive-attack dataset, and demonstrate that CADAR substantially outperforms both vision–language model baselines and traditional video classifiers, with ablations highlighting the value of the estimation module and reference-frame history. Overall, CADAR demonstrates the benefits of integrating neural perceptual foundations with symbolic, probabilistic reasoning for safe and interpretable AR systems at scale.
Abstract
Augmented Reality (AR) enriches human perception by overlaying virtual elements onto the physical world. However, this tight coupling between virtual and real content makes AR vulnerable to cognitive attacks: manipulations that distort users' semantic understanding of the environment. Existing detection methods largely focus on visual inconsistencies at the pixel or image level, offering limited semantic reasoning or interpretability. To address these limitations, we introduce CADAR, a neuro-symbolic framework for cognitive attack detection in AR that integrates neural and symbolic reasoning. CADAR fuses multimodal vision-language representations from pre-trained models into a perception graph that captures objects, relations, and temporal contextual salience. Building on this structure, a particle-filter-based statistical reasoning module infers anomalies in semantic dynamics to reveal cognitive attacks. This combination provides both the adaptability of modern vision-language models and the interpretability of probabilistic symbolic reasoning. Preliminary experiments on an AR cognitive-attack dataset demonstrate consistent advantages over existing approaches, highlighting the potential of neuro-symbolic methods for robust and interpretable AR security.
