Distributed Stochastic Optimization of a Neural Representation Network for Time-Space Tomography Reconstruction
K. Aditya Mohan, Massimiliano Ferrucci, Chuck Divin, Garrett A. Stevenson, Hyojin Kim
TL;DR
This work introduces Distributed Implicit Neural Representation (DINR) for 4D CT reconstruction, addressing the ill-posedness and limited-angle artifacts that arise when reconstructing rapidly changing scenes. DINR learns a continuous 4D representation of the object by optimizing a neural network that maps space-time coordinates to the local linear attenuation coefficient, using a forward model that samples a subset of coordinates along rays to compute projections. A distributed stochastic training scheme enables training on large HPC clusters, achieving high-fidelity reconstructions at terabyte-scale data sizes and substantially lower per-GPU memory requirements than voxel-grid INRs. Across experimental and simulated datasets, DINR outperforms state-of-the-art methods in PSNR/SSIM and visual fidelity, demonstrating strong scalability and the potential for fast, high-resolution 4D CT in dynamic in-situ experiments.
Abstract
4D time-space reconstruction of dynamic events or deforming objects using X-ray computed tomography (CT) is an important inverse problem in non-destructive evaluation. Conventional back-projection based reconstruction methods assume that the object remains static for the duration of several tens or hundreds of X-ray projection measurement images (reconstruction of consecutive limited-angle CT scans). However, this is an unrealistic assumption for many in-situ experiments that causes spurious artifacts and inaccurate morphological reconstructions of the object. To solve this problem, we propose to perform a 4D time-space reconstruction using a distributed implicit neural representation (DINR) network that is trained using a novel distributed stochastic training algorithm. Our DINR network learns to reconstruct the object at its output by iterative optimization of its network parameters such that the measured projection images best match the output of the CT forward measurement model. We use a forward measurement model that is a function of the DINR outputs at a sparsely sampled set of continuous valued 4D object coordinates. Unlike previous neural representation architectures that forward and back propagate through dense voxel grids that sample the object's entire time-space coordinates, we only propagate through the DINR at a small subset of object coordinates in each iteration resulting in an order-of-magnitude reduction in memory and compute for training. DINR leverages distributed computation across several compute nodes and GPUs to produce high-fidelity 4D time-space reconstructions. We use both simulated parallel-beam and experimental cone-beam X-ray CT datasets to demonstrate the superior performance of our approach.
