For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives
Lia Morra, Antonio Santangelo, Pietro Basci, Luca Piano, Fabio Garcea, Fabrizio Lamberti, Massimo Leone
TL;DR
The paper introduces FRESCO, a framework that bridges visual semiotics and computer vision to analyze large-scale facial image archives from social media. It operationalizes the semiotics triad of plastic, figurative, and enunciation into quantitative traits derived from state-of-the-art CV models, and defines the FRESCO-Score as an interpretable, multi-level similarity metric. The approach is validated on public datasets (FFHQ-in-the-wild and MIAP OpenImages) to assess both the accuracy of extracted quantities and the usefulness of the score for content-based retrieval and sociocultural interpretation. The authors discuss limitations, such as model dependencies and out-of-distribution concerns, and outline future directions including integration with external knowledge and broader image archives to enhance robustness and applicability.
Abstract
Social networks are creating a digital world in which the cognitive, emotional, and pragmatic value of the imagery of human faces and bodies is arguably changing. However, researchers in the digital humanities are often ill-equipped to study these phenomena at scale. This work presents FRESCO (Face Representation in E-Societies through Computational Observation), a framework designed to explore the socio-cultural implications of images on social media platforms at scale. FRESCO deconstructs images into numerical and categorical variables using state-of-the-art computer vision techniques, aligning with the principles of visual semiotics. The framework analyzes images across three levels: the plastic level, encompassing fundamental visual features like lines and colors; the figurative level, representing specific entities or concepts; and the enunciation level, which focuses particularly on constructing the point of view of the spectator and observer. These levels are analyzed to discern deeper narrative layers within the imagery. Experimental validation confirms the reliability and utility of FRESCO, and we assess its consistency and precision across two public datasets. Subsequently, we introduce the FRESCO score, a metric derived from the framework's output that serves as a reliable measure of similarity in image content.
