Revision Matters: Generative Design Guided by Revision Edits

Tao Li; Chin-Yi Cheng; Amber Xie; Gang Li; Yang Li

Revision Matters: Generative Design Guided by Revision Edits

Tao Li, Chin-Yi Cheng, Amber Xie, Gang Li, Yang Li

TL;DR

This paper curates an expert dataset that traces how human designers iteratively edit and improve a layout generation with a prompted language goal, and explores various supervised fine-tuning task setups on top of a Gemini multimodal backbone, a large multimodal model.

Abstract

Layout design, such as user interface or graphical layout in general, is fundamentally an iterative revision process. Through revising a design repeatedly, the designer converges on an ideal layout. In this paper, we investigate how revision edits from human designer can benefit a multimodal generative model. To do so, we curate an expert dataset that traces how human designers iteratively edit and improve a layout generation with a prompted language goal. Based on such data, we explore various supervised fine-tuning task setups on top of a Gemini multimodal backbone, a large multimodal model. Our results show that human revision plays a critical role in iterative layout refinement. While being noisy, expert revision edits lead our model to a surprisingly strong design FID score ~10 which is close to human performance (~6). In contrast, self-revisions that fully rely on model's own judgement, lead to an echo chamber that prevents iterative improvement, and sometimes leads to generative degradation. Fortunately, we found that providing human guidance plays at early stage plays a critical role in final generation. In such human-in-the-loop scenario, our work paves the way for iterative design revision based on pre-trained large multimodal models.

Revision Matters: Generative Design Guided by Revision Edits

TL;DR

Abstract

Paper Structure (29 sections, 4 equations, 5 figures, 7 tables)

This paper contains 29 sections, 4 equations, 5 figures, 7 tables.

Introduction
Related Works
Learning from Human Feedback
Layout Generation
Our Takes
Task Formulation
Discussions
Rare+ Dataset
Noisy and lengthy edits
Modeling
Considerations
Direct model
Hop model
Single revision model
Multi-revision model
...and 14 more sections

Figures (5)

Figure 1: Examples of state (i.e., image-code pair) in Rare+ dataset. Refer to Appx. \ref{['sec:color_legend']} for color legends.
Figure 2: Modeling overview
Figure 3: Qualitative analysis of model generations vs ground truths. See Appx. \ref{['sec:color_legend']} for color legends.
Figure 4: Visualization Colors
Figure 5: Visualization Colors

Revision Matters: Generative Design Guided by Revision Edits

TL;DR

Abstract

Revision Matters: Generative Design Guided by Revision Edits

Authors

TL;DR

Abstract

Table of Contents

Figures (5)