CREA: A Collaborative Multi-Agent Framework for Creative Image Editing and Generation
Kavana Venkatesh, Connor Dunlop, Pinar Yanardag
TL;DR
CREA presents a collaborative multi-agent framework for creative image editing and generation that mirrors human creative workflows. It decomposes the process into specialized roles—Creative Director, Prompt Architect, Generative Executor, Art Critic, and Refinement Strategist—guided by six creativity principles and evaluated via a Creativity Index. Through iterative planning, generation/editing, evaluation, and self-enhancement, CREA achieves greater diversity, semantic alignment, and artistic transformation than state-of-the-art baselines. The approach enables user-guided co-creation and has potential for extending to video and personalized workflows, highlighting the viability of autonomous, creative AI systems.
Abstract
Creativity in AI imagery remains a fundamental challenge, requiring not only the generation of visually compelling content but also the capacity to add novel, expressive, and artistically rich transformations to images. Unlike conventional editing tasks that rely on direct prompt-based modifications, creative image editing requires an autonomous, iterative approach that balances originality, coherence, and artistic intent. To address this, we introduce CREA, a novel multi-agent collaborative framework that mimics the human creative process. Our framework leverages a team of specialized AI agents who dynamically collaborate to conceptualize, generate, critique, and enhance images. Through extensive qualitative and quantitative evaluations, we demonstrate that CREA significantly outperforms state-of-the-art methods in diversity, semantic alignment, and creative transformation. To the best of our knowledge, this is the first work to introduce the task of creative editing.
