A Markovian Framing of WaveFunctionCollapse for Procedurally Generating Aesthetically Complex Environments
Franklin Yiu, Mohan Lu, Nina Li, Kevin Joseph, Tianxu Zhang, Julian Togelius, Timothy Merino, Sam Earle
TL;DR
This work tackles procedural content generation by reformulating WaveFunctionCollapse (WFC) as a Markov Decision Process (MDP), enabling objective-driven generation to run alongside WFC's constraint propagation. By decoupling constraint satisfaction from optimization, the authors demonstrate that evolving an action sequence within the WFC-MDP yields faster and more reliable convergence than traditional joint optimization approaches, across binary and biome-rich domains, with performance degrading as path-length targets grow. The study carefully analyzes representation choices (1D vs 2D action encodings) and demonstrates the framework's flexibility by extending to additional biomes, while acknowledging limitations in exploration for the hardest tasks and suggesting future RL or novelty-search approaches. Practically, this decoupled, constraint-aware formulation provides a scalable, ML-friendly blueprint for creating aesthetically complex environments that meet designer-specified objectives, with potential for interactive and scalable game-map generation on consumer hardware.
Abstract
Procedural content generation often requires satisfying both designer-specified objectives and adjacency constraints implicitly imposed by the underlying tile set. To address the challenges of jointly optimizing both constraints and objectives, we reformulate WaveFunctionCollapse (WFC) as a Markov Decision Process (MDP), enabling external optimization algorithms to focus exclusively on objective maximization while leveraging WFC's propagation mechanism to enforce constraint satisfaction. We empirically compare optimizing this MDP to traditional evolutionary approaches that jointly optimize global metrics and local tile placement. Across multiple domains with various difficulties, we find that joint optimization not only struggles as task complexity increases, but consistently underperforms relative to optimization over the WFC-MDP, underscoring the advantages of decoupling local constraint satisfaction from global objective optimization.
