Perceptual misalignment of texture representations in convolutional neural networks

Ludovica de Paolis; Fabio Anselmi; Alessio Ansuini; Eugenio Piasini

Perceptual misalignment of texture representations in convolutional neural networks

Ludovica de Paolis, Fabio Anselmi, Alessio Ansuini, Eugenio Piasini

Abstract

Mathematical modeling of visual textures traces back to Julesz's intuition that texture perception in humans is based on local correlations between image features. An influential approach for texture analysis and generation generalizes this notion to linear correlations between the nonlinear features computed by convolutional neural networks (CNNs), compiled into Gram matrices. Given that CNNs are often used as models for the visual system, it is natural to ask whether such "texture representations" spontaneously align with the textures' perceptual content, and in particular whether those CNNs that are regarded as better models for the visual system also possess more human-like texture representations. Here we compare the perceptual content captured by feature correlations computed for a diverse pool of CNNs, and we compare it to the models' perceptual alignment with the mammalian visual system as measured by Brain-Score. Surprisingly, we find that there is no connection between conventional measures of CNN quality as a model of the visual system and its alignment with human texture perception. We conclude that texture perception involves mechanisms that are distinct from those that are commonly modeled using approaches based on CNNs trained on object recognition, possibly depending on the integration of contextual information.

Perceptual misalignment of texture representations in convolutional neural networks

Abstract

Perceptual misalignment of texture representations in convolutional neural networks

Abstract

Paper Structure

Table of Contents

Figures (4)