ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data
Carmen Martin-Turrero, Maxence Bouvier, Manuel Breitenstein, Pietro Zanuttigh, Vincent Parret
TL;DR
This work tackles the challenge of extracting dense, real-time insights from sparse, asynchronous event-based data. It introduces a hybrid architecture that first builds a PointNet-based LERT embedding to create tokens from local event patches, then extends this with an asynchronous ALERT module that updates tokens on the fly using time encoding and a memory-decay mechanism. The end-to-end trainable (A)LERT-Transformer enables both high-accuracy synchronous inference and ultra-low-latency asynchronous inference, demonstrated on gesture recognition and binary classification with favorable latency-accuracy trade-offs. The approach preserves event-data sparsity while exploiting standard ML tooling, making it well-suited for edge applications requiring flexible sampling rates and energy-efficient real-time processing.
Abstract
We seek to enable classic processing of continuous ultra-sparse spatiotemporal data generated by event-based sensors with dense machine learning models. We propose a novel hybrid pipeline composed of asynchronous sensing and synchronous processing that combines several ideas: (1) an embedding based on PointNet models -- the ALERT module -- that can continuously integrate new and dismiss old events thanks to a leakage mechanism, (2) a flexible readout of the embedded data that allows to feed any downstream model with always up-to-date features at any sampling rate, (3) exploiting the input sparsity in a patch-based approach inspired by Vision Transformer to optimize the efficiency of the method. These embeddings are then processed by a transformer model trained for object and gesture recognition. Using this approach, we achieve performances at the state-of-the-art with a lower latency than competitors. We also demonstrate that our asynchronous model can operate at any desired sampling rate.
