Event2Audio: Event-Based Optical Vibration Sensing
Mingxuan Cai, Dekel Galor, Amit Pal Singh Kohli, Jacob L. Yates, Laura Waller
TL;DR
Event2Audio introduces an active, event-based vibrometry pipeline that converts laser-speckle motion, captured by a high-temporal-resolution event camera, into an audible waveform. By defocusing a speckle pattern and processing asynchronous events with fast or offline optical-flow, the method achieves real-time or near real-time audio reconstruction that outperforms prior approaches in low-frequency capture, multi-source separation, and robustness to environmental distortions, including echoes and underwater conditions. The approach uses simple, compact optics and avoids complex multi-camera setups, enabling practical deployment and broad applicability in speech recovery, noise-robust demixing, and underwater sensing. Overall, the work demonstrates that event-based sensing can significantly accelerate and improve optical vibrometry for passive-to-active translation of imperceptible vibrations into high-fidelity audio.
Abstract
Small vibrations observed in video can unveil information beyond what is visual, such as sound and material properties. It is possible to passively record these vibrations when they are visually perceptible, or actively amplify their visual contribution with a laser beam when they are not perceptible. In this paper, we improve upon the active sensing approach by leveraging event-based cameras, which are designed to efficiently capture fast motion. We demonstrate our method experimentally by recovering audio from vibrations, even for multiple simultaneous sources, and in the presence of environmental distortions. Our approach matches the state-of-the-art reconstruction quality at much faster speeds, approaching real-time processing.
