HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks
Burak Ercan, Onur Eker, Canberk Saglam, Aykut Erdem, Erkut Erdem
TL;DR
HyperE2VID addresses the challenge of reconstructing high-quality intensity videos from sparse event streams by using hypernetworks to generate per-pixel adaptive filters. A context fusion module guides the dynamic filters with information from event voxel grids and previously reconstructed frames, and a curriculum learning strategy stabilizes training. Empirical results show HyperE2VID outperforms state-of-the-art E2VID+-based methods with fewer parameters and faster inference, across multiple datasets and scenarios including high frame rates and motionless periods. An extensive ablation study clarifies the roles of context information, dynamic convolutions, and hypernetworks, while a simple post-processing step can further reduce textureless-region artifacts. This combination yields robust, efficient event-based video reconstruction with practical applicability to real-world fast-motion and low-light conditions.
Abstract
Event-based cameras are becoming increasingly popular for their ability to capture high-speed motion with low latency and high dynamic range. However, generating videos from events remains challenging due to the highly sparse and varying nature of event data. To address this, in this study, we propose HyperE2VID, a dynamic neural network architecture for event-based video reconstruction. Our approach uses hypernetworks to generate per-pixel adaptive filters guided by a context fusion module that combines information from event voxel grids and previously reconstructed intensity images. We also employ a curriculum learning strategy to train the network more robustly. Our comprehensive experimental evaluations across various benchmark datasets reveal that HyperE2VID not only surpasses current state-of-the-art methods in terms of reconstruction quality but also achieves this with fewer parameters, reduced computational requirements, and accelerated inference times.
