Human-Like Geometric Abstraction in Large Pre-trained Neural Networks
Declan Campbell, Sreejan Kumar, Tyler Giallanza, Thomas L. Griffiths, Jonathan D. Cohen
TL;DR
This work investigates whether large pre-trained neural networks can exhibit human-like geometric abstraction, challenging the view that symbolic primitives are required. It tests three core biases—geometric complexity sensitivity, regularity sensitivity, and parts/relations decomposition—across three tasks and models (DINOv2, CLIP, ResNet-50). The results show that self-supervised transformers (DINOv2, CLIP) encode geometric complexity and reproduce the regularity bias more closely to humans than ResNet, though performance on the Geoclidean parts/relations task remains below human levels. The findings suggest geometric abstractions can emerge from scale and distributional learning, offering a connectionist alternative to symbol-based theories and guiding future architectural and training innovations to enhance relational reasoning in AI.
Abstract
Humans possess a remarkable capacity to recognize and manipulate abstract structure, which is especially apparent in the domain of geometry. Recent research in cognitive science suggests neural networks do not share this capacity, concluding that human geometric abilities come from discrete symbolic structure in human mental representations. However, progress in artificial intelligence (AI) suggests that neural networks begin to demonstrate more human-like reasoning after scaling up standard architectures in both model size and amount of training data. In this study, we revisit empirical results in cognitive science on geometric visual processing and identify three key biases in geometric visual processing: a sensitivity towards complexity, regularity, and the perception of parts and relations. We test tasks from the literature that probe these biases in humans and find that large pre-trained neural network models used in AI demonstrate more human-like abstract geometric processing.
