Light Field Compression Based on Implicit Neural Representation
Henan Wang, Hanxin Zhu, Zhibo Chen
TL;DR
The paper tackles the challenge of compressing high-dimensional light field data by introducing an SAI-wise implicit neural representation that overfits to store the 4-D light field in the network parameters. After overfitting, the authors apply pruning, 8-bit quantization, and adaptive entropy coding to produce a compact bitstream, enabling flexible decoding of arbitrary viewpoints (ROI) and improved perceptual quality. Empirical results on EPFL light-field data show competitive rate-distortion performance and superior perceptual quality at higher bitrates, with significantly faster decoding compared to pixel-wise INRs like SIREN. This approach offers a practical, ROI-friendly compression framework that leverages continuous INR representations for 4-D data while benefiting from standard model-compression techniques.
Abstract
Light field, as a new data representation format in multimedia, has the ability to capture both intensity and direction of light rays. However, the additional angular information also brings a large volume of data. Classical coding methods are not effective to describe the relationship between different views, leading to redundancy left. To address this problem, we propose a novel light field compression scheme based on implicit neural representation to reduce redundancies between views. We store the information of a light field image implicitly in an neural network and adopt model compression methods to further compress the implicit representation. Extensive experiments have demonstrated the effectiveness of our proposed method, which achieves comparable rate-distortion performance as well as superior perceptual quality over traditional methods.
