Wandering around: A bioinspired approach to visual attention through object motion sensitivity

Giulia D Angelo; Victoria Clerico; Chiara Bartolozzi; Matej Hoffmann; P. Michael Furlong; Alexander Hadjiivanov

Wandering around: A bioinspired approach to visual attention through object motion sensitivity

Giulia D Angelo, Victoria Clerico, Chiara Bartolozzi, Matej Hoffmann, P. Michael Furlong, Alexander Hadjiivanov

TL;DR

The paper addresses real-time, low-power visual perception in dynamic environments by proposing an end-to-end bioinspired attention system that leverages event-based sensing and neuromorphic computation. It combines a Spiking Object Motion Sensitivity (sOMS) module with a Spiking Neural Network proto-object model to produce a saliency map, which guides a Spiking Attention Control to perform saccades toward salient objects, with fixational eye movements to reveal the next focal point. The approach is learning-free and hardware-oriented, demonstrated on the Speck neuromorphic platform with a Pan-Tilt Unit, achieving mean IoU 82.2% and SSIM 96% on EVIMO, and object detection accuracies around 89% in office and low-light scenarios, all with a real-time ~0.12 s response. This work demonstrates robust motion segmentation and attention in diverse conditions, offering a foundation for fully neuromorphic, real-time robotic perception without large training datasets and with potential for end-to-end hardware deployment.

Abstract

Active vision enables dynamic visual perception, offering an alternative to static feedforward architectures in computer vision, which rely on large datasets and high computational resources. Biological selective attention mechanisms allow agents to focus on salient Regions of Interest (ROIs), reducing computational demand while maintaining real-time responsiveness. Event-based cameras, inspired by the mammalian retina, enhance this capability by capturing asynchronous scene changes enabling efficient low-latency processing. To distinguish moving objects while the event-based camera is in motion the agent requires an object motion segmentation mechanism to accurately detect targets and center them in the visual field (fovea). Integrating event-based sensors with neuromorphic algorithms represents a paradigm shift, using Spiking Neural Networks to parallelize computation and adapt to dynamic environments. This work presents a Spiking Convolutional Neural Network bioinspired attention system for selective attention through object motion sensitivity. The system generates events via fixational eye movements using a Dynamic Vision Sensor integrated into the Speck neuromorphic hardware, mounted on a Pan-Tilt unit, to identify the ROI and saccade toward it. The system, characterized using ideal gratings and benchmarked against the Event Camera Motion Segmentation Dataset, reaches a mean IoU of 82.2% and a mean SSIM of 96% in multi-object motion segmentation. The detection of salient objects reaches 88.8% accuracy in office scenarios and 89.8% in low-light conditions on the Event-Assisted Low-Light Video Object Segmentation Dataset. A real-time demonstrator shows the system's 0.12 s response to dynamic scenes. Its learning-free design ensures robustness across perceptual scenes, making it a reliable foundation for real-time robotic applications serving as a basis for more complex architectures.

Wandering around: A bioinspired approach to visual attention through object motion sensitivity

TL;DR

Abstract

Wandering around: A bioinspired approach to visual attention through object motion sensitivity

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)