PromptMobile: Efficient Promptus for Low Bandwidth Mobile Video Streaming
Liming Liu, Jiangkai Wu, Haoyang Wang, Peiheng Wang, Zongming Guo, Xinggong Zhang
TL;DR
The paper tackles the challenge of real-time, low-bandwidth mobile video streaming using diffusion-based Promptus, which traditionally requires desktop-grade compute. It introduces PromptMobile, an on-device acceleration framework that combines a two-stage generation pathway, fine-grained inter-frame caching, and system-level optimizations to reach ambitious mobile performance. The key contributions include a $8.1\times$ reduction in generation cost, a $16.6\%$ reduction from inter-frame caching, and a $13.6\times$ speedup over the original Promptus, while delivering an average LPIPS improvement of $0.016$ at 280 kbps and reducing 60% of severely distorted frames compared to VQGAN. The approach demonstrates practical impact for bandwidth-constrained, mobile video streaming and showcases the viability of optimized on-device diffusion pipelines with hardware-aware acceleration.
Abstract
Traditional video compression algorithms exhibit significant quality degradation at extremely low bitrates. Promptus emerges as a new paradigm for video streaming, substantially cutting down the bandwidth essential for video streaming. However, Promptus is computationally intensive and can not run in real-time on mobile devices. This paper presents PromptMobile, an efficient acceleration framework tailored for on-device Promptus. Specifically, we propose (1) a two-stage efficient generation framework to reduce computational cost by 8.1x, (2) a fine-grained inter-frame caching to reduce redundant computations by 16.6%, (3) system-level optimizations to further enhance efficiency. The evaluations demonstrate that compared with the original Promptus, PromptMobile achieves a 13.6x increase in image generation speed. Compared with other streaming methods, PromptMobile achives an average LPIPS improvement of 0.016 (compared with H.265), reducing 60% of severely distorted frames (compared to VQGAN).
