Accelerating Black Hole Image Generation via Latent Space Diffusion Models
Ao Liu, Xudong Zhang, Cuihong Wen, Wentao Liu, Jieci Wang
TL;DR
The paper tackles the computational bottleneck in horizon-scale black hole imaging posed by General Relativistic Ray Tracing, introducing a latent-space diffusion approach trained on PCA-compressed GRRT data. By projecting 65,536-dimensional images into a 256-dimensional latent space and conditioning a diffusion model on physical parameters, the method achieves real-time-like image generation with a fixed PCA decoder, while maintaining physical fidelity. The core innovations include integrating a self-attention mechanism into the parameter-conditioning pathway and operating diffusion entirely in the latent space, resulting in substantial improvements in both image quality and parameter estimation over pixel-space baselines. This approach enables rapid parameter exploration and robust inference for next-generation black hole imaging, with potential extensions to other accretion-flow models and polarization information.
Abstract
Interpreting horizon-scale black hole images currently relies on computationally intensive General Relativistic Ray Tracing (GRRT) simulations, which pose a significant bottleneck for rapid parameter exploration and high-precision tests of strong-field gravity. We demonstrate that physically accurate black hole images, synthesized from magnetized accretion flows, inherently reside on a low-dimensional manifold-encoding the essential features of spacetime geometry, plasma distribution, and relativistic emission. Leveraging this structure, we introduce a physics-conditioned diffusion model that operates in a compact latent space to generate high-fidelity black hole imagery directly from physical parameters. The model accurately reproduces critical observational signatures from full GRRT simulations-such as shadow diameter, photon-ring structure, and relativistic brightness asymmetry-while achieving over a fourfold reduction in computational expense. Compared with the previous generation of denoising diffusion models, the proposed approach achieves significant improvements in image quality, reconstruction fidelity, and parameter estimation accuracy, while reducing the average inference time per black hole image from 5.25 seconds to 1.15 seconds. Our work establishes diffusion-based latent models as efficient and scalable substitutes for traditional radiative transfer solvers, offering a practical framework toward real-time modeling and inference for next-generation black hole imaging.
