Generative Design through Quality-Diversity Data Synthesis and Language Models
Adam Gaier, James Stoddart, Lorenzo Villaggi, Shyam Sudhakaran
TL;DR
Problem: Generative design in engineering requires diverse, high-quality data and strict constraint adherence. Approach: Synthesize diverse, high-performing data via Quality-Diversity (MAP-Elites), fine-tune a language model on that data to generate high-level designs, and refine them into constraint-satisfying layouts with Wave Function Collapse (WFC). Findings: Datasets built with QD substantially improve fidelity to natural-language prompts and overall validity compared with randomly sampled data, enabling reliable, text-guided architectural layouts via TileGPT. Significance: This data-efficient, interactive framework merges evolutionary data generation with symbolic constraint solving to make generative design more accessible and controllable in architecture and related engineering domains.
Abstract
Two fundamental challenges face generative models in engineering applications: the acquisition of high-performing, diverse datasets, and the adherence to precise constraints in generated designs. We propose a novel approach combining optimization, constraint satisfaction, and language models to tackle these challenges in architectural design. Our method uses Quality-Diversity (QD) to generate a diverse, high-performing dataset. We then fine-tune a language model with this dataset to generate high-level designs. These designs are then refined into detailed, constraint-compliant layouts using the Wave Function Collapse algorithm. Our system demonstrates reliable adherence to textual guidance, enabling the generation of layouts with targeted architectural and performance features. Crucially, our results indicate that data synthesized through the evolutionary search of QD not only improves overall model performance but is essential for the model's ability to closely adhere to textual guidance. This improvement underscores the pivotal role evolutionary computation can play in creating the datasets key to training generative models for design. Web article at https://tilegpt.github.io
