CREward: A Type-Specific Creativity Reward Model
Jiyeon Han, Ali Mahdavi-Amiri, Hao Zhang, Haedong Jeong
TL;DR
CREward formalizes creativity along three interpretable axes—geometry, material, and texture—and uses a human LVLM-aligned framework to produce a type-specific reward for guiding and evaluating creative image generation. By building CREBench, correlating human judgments with open and closed LVLMs, and training a lightweight reward head with LVLM-derived labels, the approach achieves strong alignment with human perception across all creativity types. CREward enables practical applications including type-aware creativity assessment, sampling for design inspiration, and controllable diffusion-generation via LoRA sliders, with Grad-CAM providing explainable highlights of creativity-relevant regions. While promising, the authors note limitations such as bias toward novelty and entanglement among creativity types, suggesting future work on disentanglement and joint value assessment to complement novelty-focused signals. Overall, the work offers a scalable, interpretable framework for evaluating and steering creativity in generative visual systems with real-world design implications.
Abstract
Creativity is a complex phenomenon. When it comes to representing and assessing creativity, treating it as a single undifferentiated quantity would appear naive and underwhelming. In this work, we learn the \emph{first type-specific creativity reward model}, coined CREward, which spans three creativity ``axes," geometry, material, and texture, to allow us to view creativity through the lens of the image formation pipeline. To build our reward model, we first conduct a human benchmark evaluation to capture human perception of creativity for each type across various creative images. We then analyze the correlation between human judgments and predictions by large vision-language models (LVLMs), confirming that LVLMs exhibit strong alignment with human perception. Building on this observation, we collect LVLM-generated labels to train our CREward model that is applicable to both evaluation and generation of creative images. We explore three applications of CREward: creativity assessment, explainable creativity, and creative sample acquisition for both human design inspiration and guiding creative generation through low-rank adaptation.
