CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation
Chen Wei, Jiachen Zou, Dietmar Heinke, Quanying Liu
TL;DR
CoCoG-2 addresses how to controllably generate visual stimuli in the space of human concepts to study how concept representations influence behavior. It extends the prior CoCoG framework by introducing training-free guidance that decomposes generation into prior distributions and likelihood constraints, enabling flexible manipulation of concepts, semantics, and judgments without retraining. The method combines a concept encoder/decoder with CLIP embeddings in a two-stage diffusion process and supports multiple guidance strategies (concept, smoothness, semantics, judgment, uncertainty, pixel) and improvements (adaptive gradient scheduling, resampling). Experiments demonstrate diverse generation, smooth concept transitions, robust image editing, behavioral manipulation of similarity judgments, and information-maximizing designs for individual preferences, validating causal links between concepts and behavior. The approach offers a versatile toolkit for cognitive science experiments and AI-driven stimulus design.
Abstract
Humans interpret complex visual stimuli using abstract concepts that facilitate decision-making tasks such as food selection and risk avoidance. Similarity judgment tasks are effective for exploring these concepts. However, methods for controllable image generation in concept space are underdeveloped. In this study, we present a novel framework called CoCoG-2, which integrates generated visual stimuli into similarity judgment tasks. CoCoG-2 utilizes a training-free guidance algorithm to enhance generation flexibility. CoCoG-2 framework is versatile for creating experimental stimuli based on human concepts, supporting various strategies for guiding visual stimuli generation, and demonstrating how these stimuli can validate various experimental hypotheses. CoCoG-2 will advance our understanding of the causal relationship between concept representations and behaviors by generating visual stimuli. The code is available at \url{https://github.com/ncclab-sustech/CoCoG-2}.
