SCI: A Metacognitive Control for Signal Dynamics
Vishal Joshua Meesala
TL;DR
This work reframes interpretability as a real-time, controllable state rather than a static property, proposing the Surgical Cognitive Interpreter (SCI), a closed-loop metacognitive layer that wraps a stochastic predictor. SCI uses a reliability-weighted, multi-scale signal representation P(t,s), a knowledge-guided interpreter ψ_Θ, and a Lyapunov-guided controller to minimize interpretive error ΔSP while safeguarding stability, including a bounded human-in-the-loop gain. Empirically, SCI demonstrates emergent metacognition by allocating substantially more compute to ambiguous inputs (up to ~3.6×–3.8×) and yields ΔSP as a usable safety signal with AUROC in the 0.63–0.86 range across MNIST, MIT-BIH, and bearings; it matches fixed-ensemble accuracy with lower average computation. The framework unifies signal processing, cognitive modeling, and control theory to deliver robust, auditable, and safer AI suitable for safety-critical settings, with clear paths for causal extensions and broader modality deployment.
Abstract
Modern deep learning systems are typically deployed as open-loop function approximators: they map inputs to outputs in a single pass, without regulating how much computation or explanatory effort is spent on a given case. In safety-critical settings, this is brittle: easy and ambiguous inputs receive identical processing, and uncertainty is only read off retrospectively from raw probabilities. We introduce the Surgical Cognitive Interpreter (SCI), a lightweight closed-loop metacognitive control layer that wraps an existing stochastic model and turns prediction into an iterative process. SCI monitors a scalar interpretive state SP(t), here instantiated as a normalized entropy-based confidence signal, and adaptively decides whether to stop, continue sampling, or abstain. The goal is not to improve accuracy per se, but to regulate interpretive error ΔSP and expose a safety signal that tracks when the underlying model is likely to fail. We instantiate SCI around Monte Carlo dropout classifiers in three domains: vision (MNIST digits), medical time series (MIT-BIH arrhythmia), and industrial condition monitoring (rolling-element bearings). In all cases, the controller allocates more inference steps to misclassified inputs than to correct ones (up to about 3-4x on MNIST and bearings, and 1.4x on MIT-BIH). The resulting ΔSP acts as a usable safety signal for detecting misclassifications (AUROC 0.63 on MNIST, 0.70 on MIT-BIH, 0.86 on bearings). Code and reproducibility: https://github.com/vishal-1344/sci
