Volumetric Temporal Texture Synthesis for Smoke Stylization using Neural Cellular Automata
Dongqing Wang, Ehsan Pajouheshgar, Yitao Xu, Tong Zhang, Sabine Süsstrunk
TL;DR
This work tackles the challenge of stylizing 3D volumetric smoke with spatiotemporal coherence and efficiency. It introduces Volumetric Neural Cellular Automata (VNCA), a lightweight, trainable 3D texture generator that is trained on a single density frame yet can stylize the entire sequence in real time by leveraging self‑emerging motion and an Eulerian rendering pipeline. VNCA combines a perception‑based update rule with density and velocity priors and a differentiable volume renderer, supervised by a deep style loss and flow‑guided motion terms, achieving real‑time, multi‑view coherent stylization and generalization to unseen data. The approach speeds up training by over an order of magnitude compared to prior volumetric NST methods and can be extended to mesh texturing, offering a practical tool for artists and graphics pipelines. The results demonstrate strong appearance match to reference styles, temporally smooth transitions, and plausible motion consistent with the input smoke dynamics.
Abstract
Artistic stylization of 3D volumetric smoke data is still a challenge in computer graphics due to the difficulty of ensuring spatiotemporal consistency given a reference style image, and that within reasonable time and computational resources. In this work, we introduce Volumetric Neural Cellular Automata (VNCA), a novel model for efficient volumetric style transfer that synthesizes, in real-time, multi-view consistent stylizing features on the target smoke with temporally coherent transitions between stylized simulation frames. VNCA synthesizes a 3D texture volume with color and density stylization and dynamically aligns this volume with the intricate motion patterns of the smoke simulation under the Eulerian framework. Our approach replaces the explicit fluid advection modeling and the inter-frame smoothing terms with the self-emerging motion of the underlying cellular automaton, thus reducing the training time by over an order of magnitude. Beyond smoke simulations, we demonstrate the versatility of our approach by showcasing its applicability to mesh stylization.
