Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays
Zhenxing Dong, Jidong Jia, Yan Li, Yuye Ling
TL;DR
The paper tackles the memory bottleneck in deep-learning CGH for ultra-high-definition holograms by introducing a divide-conquer-and-merge strategy that splits inputs into $r^2$ sub-images, processes them with phase-generator/phase-encoder branches, and merges them via upsampling to form full-resolution holograms. Central to the approach is a lightweight holographic SR network (LFMN) with Local Feature Modulation and Enhanced Convolutional Channel Mixer, designed to preserve quality while reducing memory usage; the method is extensible through a recursive pyramid form for even larger scales. Empirical results show substantial training memory reductions (e.g., 64.3% for HoloNet and 12.9% for CCNNs) and inference speedups (up to 3× and 2×), enabling 8K holograms in simulations and verified by full-color optical experiments. The framework demonstrates significant practical impact by enabling high-definition holographic displays on consumer GPUs and provides a path toward 16K+ holograms through further architectural integration and memory-aware design.
Abstract
Recently, deep learning-based computer-generated holography (CGH) has demonstrated tremendous potential in three-dimensional (3D) displays and yielded impressive display quality. However, most existing deep learning-based CGH techniques can only generate holograms of 1080p resolution, which is far from the ultra-high resolution (16K+) required for practical virtual reality (VR) and augmented reality (AR) applications to support a wide field of view and large eye box. One of the major obstacles in current CGH frameworks lies in the limited memory available on consumer-grade GPUs which could not facilitate the generation of higher-definition holograms. To overcome the aforementioned challenge, we proposed a divide-conquer-and-merge strategy to address the memory and computational capacity scarcity in ultra-high-definition CGH generation. This algorithm empowers existing CGH frameworks to synthesize higher-definition holograms at a faster speed while maintaining high-fidelity image display quality. Both simulations and experiments were conducted to demonstrate the capabilities of the proposed framework. By integrating our strategy into HoloNet and CCNNs, we achieved significant reductions in GPU memory usage during the training period by 64.3\% and 12.9\%, respectively. Furthermore, we observed substantial speed improvements in hologram generation, with an acceleration of up to 3$\times$ and 2 $\times$, respectively. Particularly, we successfully trained and inferred 8K definition holograms on an NVIDIA GeForce RTX 3090 GPU for the first time in simulations. Furthermore, we conducted full-color optical experiments to verify the effectiveness of our method. We believe our strategy can provide a novel approach for memory- and time-efficient holographic displays.
