Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration

Chethana Prasad Kabgere

Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration

Chethana Prasad Kabgere

TL;DR

The paper investigates how humans and AI categorize ambiguous low-resolution visuals, contrasting symbolic, analogical, and embodied human strategies with feature-based CNN processing. Using a ResNet-18 baseline and Grad-CAM visualizations on CIFAR-10 stimuli, the study demonstrates robust human performance driven by shape-based prototypes and contextual grounding, while the AI relies on texture-driven features with limited interpretability. The findings highlight parallels and gaps across Marr’s levels, bounded rationality, and PDP-inspired representations, and argue for neuro-symbolic architectures that fuse structured reasoning with sub-symbolic perception for improved interpretability and robustness. The work advances understanding of cognitive alignment in AI, proposing concrete pathways for interpretable, context-aware systems that more closely mimic human visual reasoning and decision-making.

Abstract

Understanding how humans and AI systems interpret ambiguous visual stimuli offers critical insight into the nature of perception, reasoning, and decision-making. This paper examines image labeling performance across human participants and deep neural networks, focusing on low-resolution, perceptually degraded stimuli. Drawing from computational cognitive science, cognitive architectures, and connectionist-symbolic hybrid models, we contrast human strategies such as analogical reasoning, shape-based recognition, and confidence modulation with AI's feature-based processing. Grounded in Marr's tri-level hypothesis, Simon's bounded rationality, and Thagard's frameworks of representation and emotion, we analyze participant responses in relation to Grad-CAM visualizations of model attention. Human behavior is further interpreted through cognitive principles modeled in ACT-R and Soar, revealing layered and heuristic decision strategies under uncertainty. Our findings highlight key parallels and divergences between biological and artificial systems in representation, inference, and confidence calibration. The analysis motivates future neuro-symbolic architectures that unify structured symbolic reasoning with connectionist representations. Such architectures, informed by principles of embodiment, explainability, and cognitive alignment, offer a path toward AI systems that are not only performant but also interpretable and cognitively grounded.

Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration

TL;DR

Abstract

Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)