Neural Scene Baking for Permutation Invariant Transparency Rendering with Real-time Global Illumination
Ziyang Zhang, Edgar Simo-Serra
TL;DR
This work tackles real-time rendering of scenes with transparent objects and global illumination by introducing GlassNet, a neural renderer that separates opaque and transparent G-buffers and uses a permutation-invariant blending function to achieve order-independent transparency. GlassNet comprises a scene encoder, a permutation-invariant transparency buffer blender, a final blending network, and a rendering network, all trained end-to-end to predict indirect lighting alongside direct lighting. The approach delivers real-time performance (e.g., 256×256 at 63 FPS; 512×512 at 32 FPS) while preserving complex transparency shading and textures, and it improves memory efficiency through a symmetric, accumulation-based blending of transparency buffers. Limitations include challenges with strong refraction and participating media, with potential future work in path prediction and hybrid neural- rendering techniques to broaden applicability and fidelity.
Abstract
Neural rendering provides a fundamentally new way to render photorealistic images. Similar to traditional light-baking methods, neural rendering utilizes neural networks to bake representations of scenes, materials, and lights into latent vectors learned from path-tracing ground truths. However, existing neural rendering algorithms typically use G-buffers to provide position, normal, and texture information of scenes, which are prone to occlusion by transparent surfaces, leading to distortions and loss of detail in the rendered images. To address this limitation, we propose a novel neural rendering pipeline that accurately renders the scene behind transparent surfaces with global illumination and variable scenes. Our method separates the G-buffers of opaque and transparent objects, retaining G-buffer information behind transparent objects. Additionally, to render the transparent objects with permutation invariance, we designed a new permutation-invariant neural blending function. We integrate our algorithm into an efficient custom renderer to achieve real-time performance. Our results show that our method is capable of rendering photorealistic images with variable scenes and viewpoints, accurately capturing complex transparent structures along with global illumination. Our renderer can achieve real-time performance ($256\times 256$ at 63 FPS and $512\times 512$ at 32 FPS) on scenes with multiple variable transparent objects.
