UIClip: A Data-driven Model for Assessing User Interface Design
Jason Wu, Yi-Hao Peng, Amanda Li, Amanda Swearngin, Jeffrey P. Bigham, Jeffrey Nichols
TL;DR
UIClip addresses the challenge of objective, scalable UI design quality assessment by linking screenshots with natural-language descriptions through a CLIP-based framework. It introduces JitterWeb, a 2.3 million-example synthetic dataset of jittered UIs, and BetterApp, a 1.2K-rated human dataset, to train and validate design quality scoring and surface design suggestions. Across extensive benchmarks, UIClip outperforms large vision-language baselines in design quality, design suggestion accuracy, and design relevance, while maintaining a compact model size. The work demonstrates practical applications in UI code generation, design guidance, and example retrieval, and commits to releasing training data and code to foster further research in data-driven UI design evaluation.
Abstract
User interface (UI) design is a difficult yet important task for ensuring the usability, accessibility, and aesthetic qualities of applications. In our paper, we develop a machine-learned model, UIClip, for assessing the design quality and visual relevance of a UI given its screenshot and natural language description. To train UIClip, we used a combination of automated crawling, synthetic augmentation, and human ratings to construct a large-scale dataset of UIs, collated by description and ranked by design quality. Through training on the dataset, UIClip implicitly learns properties of good and bad designs by i) assigning a numerical score that represents a UI design's relevance and quality and ii) providing design suggestions. In an evaluation that compared the outputs of UIClip and other baselines to UIs rated by 12 human designers, we found that UIClip achieved the highest agreement with ground-truth rankings. Finally, we present three example applications that demonstrate how UIClip can facilitate downstream applications that rely on instantaneous assessment of UI design quality: i) UI code generation, ii) UI design tips generation, and iii) quality-aware UI example search.
