From Cluster to Desktop: A Cache-Accelerated INR framework for Interactive Visualization of Tera-Scale Data
Daniel Zavorotny, Qi Wu, David Bauer, Kwan-Liu Ma
TL;DR
The paper addresses the challenge of visualizing tera-scale scientific data with implicit neural representations by introducing a cache-accelerated INR rendering framework that combines compressed INRs with a scalable MRPD GPU cache. The approach adds a saliency-based priority scheduler and asynchronous brick loading to minimize per-frame INR inferences, enabling interactive visualization on consumer hardware. Key contributions include a full pipeline integrating INR compression with MRPD caching, a bricks-based cache architecture with priority ranking and miss handling, and extensive evaluation demonstrating around a fivefold speedup for ray marching and substantial scalability on large datasets. This work broadens the practical use of INRs for high-performance scientific visualization and lays groundwork for integrating neural representations into extreme-scale visualization workflows.
Abstract
Machine learning has enabled the use of implicit neural representations (INRs) to efficiently compress and reconstruct massive scientific datasets. However, despite advances in fast INR rendering algorithms, INR-based rendering remains computationally expensive, as computing data values from an INR is significantly slower than reading them from GPU memory. This bottleneck currently restricts interactive INR visualization to professional workstations. To address this challenge, we introduce an INR rendering framework accelerated by a scalable, multi-resolution GPU cache capable of efficiently representing tera-scale datasets. By minimizing redundant data queries and prioritizing novel volume regions, our method reduces the number of INR computations per frame, achieving an average 5x speedup over the state-of-the-art INR rendering method while still maintaining high visualization quality. Coupled with existing hardware-accelerated INR compressors, our framework enables scientists to generate and compress massive datasets in situ on high-performance computing platforms and then interactively explore them on consumer-grade hardware post hoc.
