Having Second Thoughts? Let's hear it
Jung H. Lee, Sujith Vijayan
TL;DR
The paper tackles the robustness of deep learning vision models by introducing STCert, a two-stage, brain-inspired certification that leverages top-down information to improve reliability. It employs foundation segmentation models (Grounding DINO and SAM) to identify ROIs from an original prediction and generates second-thought classifications, comparing them to certify outputs. Across ImageNet subsets and multiple architectures, context-aware STCert reduces inter-category errors and can detect artificial and natural adversarial inputs, particularly when the original and second thoughts are produced by different classifiers. While STCert acts as an error detector (not a fix) and incurs computational costs, it demonstrates meaningful gains in safety-focused scenarios and reveals how context and high-level priors influence DL decision-making.
Abstract
Deep learning models loosely mimic bottom-up signal pathways from low-order sensory areas to high-order cognitive areas. After training, DL models can outperform humans on some domain-specific tasks, but their decision-making process has been known to be easily disrupted. Since the human brain consists of multiple functional areas highly connected to one another and relies on intricate interplays between bottom-up and top-down (from high-order to low-order areas) processing, we hypothesize that incorporating top-down signal processing may make DL models more robust. To address this hypothesis, we propose a certification process mimicking selective attention and test if it could make DL models more robust. Our empirical evaluations suggest that this newly proposed certification can improve DL models' accuracy and help us build safety measures to alleviate their vulnerabilities with both artificial and natural adversarial examples.
