Machine Apophenia: The Kaleidoscopic Generation of Architectural Images

Alexey Tikhonov; Dmitry Sinyavin

Machine Apophenia: The Kaleidoscopic Generation of Architectural Images

Alexey Tikhonov, Dmitry Sinyavin

TL;DR

Addresses how AI can generate architecturally meaningful imagery from random inputs and how audiences perceive AI-driven design. Introduces a kaleidoscope-inspired, machine apophenia framework and an iterative pipeline that seeds keyphrases from $50$ images, expands to $408$ terms, and refines images from $512\times512$ to $1024\times1024$ using an ensemble of CLIP-interrogation, GPT-3, BLIP2, Stable Diffusion v1.4, and SDXL. Key findings include ablation studies showing each component improves aesthetic and technical metrics, plus an observational study linking more engaging keyphrases to higher engagement (up to about $63$ percent). The work outlines implications for AI creativity and the distribution of AI-generated architectural content across social channels, highlighting both opportunities and evaluation challenges.

Abstract

This study investigates the application of generative artificial intelligence in architectural design. We present a novel methodology that combines multiple neural networks to create an unsupervised and unmoderated stream of unique architectural images. Our approach is grounded in the conceptual framework called machine apophenia. We hypothesize that neural networks, trained on diverse human-generated data, internalize aesthetic preferences and tend to produce coherent designs even from random inputs. The methodology involves an iterative process of image generation, description, and refinement, resulting in captioned architectural postcards automatically shared on several social media platforms. Evaluation and ablation studies show the improvement both in technical and aesthetic metrics of resulting images on each step.

Machine Apophenia: The Kaleidoscopic Generation of Architectural Images

TL;DR

images, expands to

terms, and refines images from

using an ensemble of CLIP-interrogation, GPT-3, BLIP2, Stable Diffusion v1.4, and SDXL. Key findings include ablation studies showing each component improves aesthetic and technical metrics, plus an observational study linking more engaging keyphrases to higher engagement (up to about

percent). The work outlines implications for AI creativity and the distribution of AI-generated architectural content across social channels, highlighting both opportunities and evaluation challenges.

Abstract

Paper Structure (8 sections, 4 figures, 2 tables)

This paper contains 8 sections, 4 figures, 2 tables.

Introduction
Framework
Approach
Evaluation
Observational Study
Discussion
Acknowledgments.
Appendix A.

Figures (4)

Figure 1: An overall flowchart of the process.
Figure 3: Examples of images corresponding to queries with high concentrations of high-engaging keyphrases.
Figure 4: Examples of images corresponding to queries with high concentrations of low-engaging keyphrases.
Figure 5: Appearance in different publication channels.

Machine Apophenia: The Kaleidoscopic Generation of Architectural Images

TL;DR

Abstract

Machine Apophenia: The Kaleidoscopic Generation of Architectural Images

Authors

TL;DR

Abstract

Table of Contents

Figures (4)