Tracking objects that change in appearance with phase synchrony
Sabine Muzellec, Drew Linsley, Alekh K. Ashok, Ennio Mingolla, Girik Malik, Rufin VanRullen, Thomas Serre
TL;DR
The study tackles object tracking when appearance changes—such as color, shape, or position—over time. It introduces a complex-valued recurrent neural network (CV-RNN) that uses neural synchrony via phase information to bind object features to their locations, allowing attention to be independent of appearance. Through the FeatureTracker benchmark, CV-RNN approaches human performance and often outperforms other deep networks, providing a computational proof-of-concept that phase synchronization can support tracking of appearance-morphing objects. The work suggests concrete neural mechanisms and predictions for neuroscience, and makes data and code publicly available to spur further investigation into aligning machine vision with human-like tracking strategies.
Abstract
Objects we encounter often change appearance as we interact with them. Changes in illumination (shadows), object pose, or the movement of non-rigid objects can drastically alter available image features. How do biological visual systems track objects as they change? One plausible mechanism involves attentional mechanisms for reasoning about the locations of objects independently of their appearances -- a capability that prominent neuroscience theories have associated with computing through neural synchrony. Here, we describe a novel deep learning circuit that can learn to precisely control attention to features separately from their location in the world through neural synchrony: the complex-valued recurrent neural network (CV-RNN). Next, we compare object tracking in humans, the CV-RNN, and other deep neural networks (DNNs), using FeatureTracker: a large-scale challenge that asks observers to track objects as their locations and appearances change in precisely controlled ways. While humans effortlessly solved FeatureTracker, state-of-the-art DNNs did not. In contrast, our CV-RNN behaved similarly to humans on the challenge, providing a computational proof-of-concept for the role of phase synchronization as a neural substrate for tracking appearance-morphing objects as they move about.
