Neural Cellular Automata: From Cells to Pixels
Ehsan Pajouheshgar, Yitao Xu, Ali Abbasi, Alexander Mordvintsev, Wenzel Jakob, Sabine Süsstrunk
TL;DR
The paper addresses the resolution bottleneck of Neural Cellular Automata by decoupling dynamics from appearance: NCAs run on a coarse grid while a lightweight Local Pattern Producing Network (LPPN) renders high-resolution outputs as a neural field conditioned on local cell states and coordinates. End-to-end training uses task-specific losses for texture synthesis and morphology growth, including patch-based multi-scale OT texture loss, PBR cross-map alignment via pseudo targets, auto-correlation regularization, and LPIPS perceptual terms. The hybrid NCA+LPPN achieves real-time, high-resolution outputs across 2D/3D grids and meshes (e.g., $96$–$128^2$ NCA grids render to $768$–$1024^2$ or higher resolutions) while preserving the self-organizing properties of NCAs such as robustness and regeneration. An interactive web demo in the browser demonstrates practical deployment, highlighting the approach’s potential for scalable, generative self-organization in graphics and texture synthesis. The work lays groundwork for extending to full 3D assets, higher resolutions, and tighter integration between rendering and learning objectives, offering a pathway to efficient, controllable, high-fidelity self-organizing systems.
Abstract
Neural Cellular Automata (NCAs) are bio-inspired dynamical systems in which identical cells iteratively apply a learned local update rule to self-organize into complex patterns, exhibiting regeneration, robustness, and spontaneous dynamics. Despite their success in texture synthesis and morphogenesis, NCAs remain largely confined to low-resolution outputs. This limitation stems from (1) training time and memory requirements that grow quadratically with grid size, (2) the strictly local propagation of information that impedes long-range cell communication, and (3) the heavy compute demands of real-time inference at high resolution. In this work, we overcome this limitation by pairing an NCA that evolves on a coarse grid with a lightweight implicit decoder that maps cell states and local coordinates to appearance attributes, enabling the same model to render outputs at arbitrary resolution. Moreover, because both the decoder and NCA updates are local, inference remains highly parallelizable. To supervise high-resolution outputs efficiently, we introduce task-specific losses for morphogenesis (growth from a seed) and texture synthesis with minimal additional memory and computation overhead. Our experiments across 2D/3D grids and mesh domains demonstrate that our hybrid models produce high-resolution outputs in real-time, and preserve the characteristic self-organizing behavior of NCAs.
