EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation

Futian Wang; Fan Zhang; Xiao Wang; Mengqi Wang; Dexing Huang; Jin Tang

EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation

Futian Wang, Fan Zhang, Xiao Wang, Mengqi Wang, Dexing Huang, Jin Tang

TL;DR

This work addresses the challenge of sparse, asynchronous event streams by modeling event tokens as nodes in a hypergraph guided by RGB context to perform spatio-temporal completion. The EvRainDrop framework employs two-stage hypergraph propagation—dynamic self-completion and cross-modal enhancement—followed by temporal self-attention to fuse information across time. It demonstrates state-of-the-art performance on both single-label (HAR) and multi-label (PAR) tasks across four datasets, validating the effectiveness of high-order, multimodal hypergraph completion for event-based perception. The proposed approach offers a principled path to mitigate spatial sparsity while leveraging rich temporal dynamics, with practical implications for robust RGB-Event fusion in real-world scenarios.

Abstract

Event cameras produce asynchronous event streams that are spatially sparse yet temporally dense. Mainstream event representation learning algorithms typically use event frames, voxels, or tensors as input. Although these approaches have achieved notable progress, they struggle to address the undersampling problem caused by spatial sparsity. In this paper, we propose a novel hypergraph-guided spatio-temporal event stream completion mechanism, which connects event tokens across different times and spatial locations via hypergraphs and leverages contextual information message passing to complete these sparse events. The proposed method can flexibly incorporate RGB tokens as nodes in the hypergraph within this completion framework, enabling multi-modal hypergraph-based information completion. Subsequently, we aggregate hypergraph node information across different time steps through self-attention, enabling effective learning and fusion of multi-modal features. Extensive experiments on both single- and multi-label event classification tasks fully validated the effectiveness of our proposed framework. The source code of this paper will be released on https://github.com/Event-AHU/EvRainDrop.

EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation

TL;DR

Abstract

EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)