Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and Accuracy
Boyuan Tian, Yihan Pang, Muhammad Huzaifa, Shenlong Wang, Sarita Adve
TL;DR
This work tackles the energy efficiency challenges in XR scene reconstruction by addressing the trilemma of energy, latency, and accuracy in TSDF Fusion. It introduces three optimization classes—algorithm, execution, and data—that collectively define a large design space of 72 configurations and reveal a Pareto frontier achievable only through co-optimization and consideration of downstream consumer needs. Empirical results on an embedded platform show energy reductions up to $60\times$, with latency and accuracy trade-offs depending on constraints, and a ScanNet case achieving around $25\times$ energy savings with modest latency impact and negligible quality loss. The findings highlight the necessity of system-wide optimization for XR perception tasks and provide actionable guidance for deploying energy-efficient designs on mobile XR devices and beyond.
Abstract
Extended Reality (XR) enables immersive experiences through untethered headsets but suffers from stringent battery and resource constraints. Energy-efficient design is crucial to ensure both longevity and high performance in XR devices. However, latency and accuracy are often prioritized over energy, leading to a gap in achieving energy efficiency. This paper examines scene reconstruction, a key building block for immersive XR experiences, and demonstrates how energy efficiency can be achieved by navigating the trilemma of energy, latency, and accuracy. We explore three classes of energy-oriented optimizations, covering the algorithm, execution, and data, that reveal a broad design space through configurable parameters. Our resulting 72 designs expose a wide range of latency and energy trade-offs, with a smaller range of accuracy loss. We identify a Pareto-optimal curve and show that the designs on the curve are achievable only through synergistic co-optimization of all three optimization classes and by considering the latency and accuracy needs of downstream scene reconstruction consumers. Our analysis covering various use cases and measurements on an embedded class system shows that, relative to the baseline, our designs offer energy benefits of up to 60X with potential latency range of 4X slowdown to 2X speedup. Detailed exploration of a use case across representative data sequences from ScanNet showed about 25X energy savings with 1.5X latency reduction and negligible reconstruction quality loss.
