PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?
Martin Spitznagel, Jan Vaillant, Janis Keuper
TL;DR
PhysicsGen investigates whether modern generative image models can learn complex physical relations from input-output image pairs. The authors release a 300k-image-pair benchmark across three physical tasks and assess speedups versus ground-truth PDE-based simulations using architectures including GANs, U-Net, VAEs, and diffusion models. They find notable runtime speedups (up to $2\times 10^4$) for simple 0th- and 1st-order dynamics but observe substantial gaps in accuracy for higher-order terms, underscoring the need for physics-informed losses. The work provides a scalable benchmark and dataset to guide development of neural-enhanced physical simulations.
Abstract
The image-to-image translation abilities of generative learning models have recently made significant progress in the estimation of complex (steered) mappings between image distributions. While appearance based tasks like image in-painting or style transfer have been studied at length, we propose to investigate the potential of generative models in the context of physical simulations. Providing a dataset of 300k image-pairs and baseline evaluations for three different physical simulation tasks, we propose a benchmark to investigate the following research questions: i) are generative models able to learn complex physical relations from input-output image pairs? ii) what speedups can be achieved by replacing differential equation based simulations? While baseline evaluations of different current models show the potential for high speedups (ii), these results also show strong limitations toward the physical correctness (i). This underlines the need for new methods to enforce physical correctness. Data, baseline models and evaluation code http://www.physics-gen.org.
