A Reservoir-based Model for Human-like Perception of Complex Rhythm Pattern
Zhongju Yuan, Geraint Wiggins, Dick Botteldooren
TL;DR
To address how humans perceive and predict complex rhythms, the paper proposes a biologically inspired four-layer reservoir that implements predictive coding across Tatums, Tactus, Higher Cognition, and Motor layers. The core dynamics follow $h_{t+1}=(1-α)h_t+α f(W_{in} x_t+W h_t+ξ_t)$ with outputs $y_{hat}_t = W_{out} h_t$, enabling predictions ahead of the input rhythm. The Tatum Layer extracts the smallest time unit, the Tactus Layer encodes meter, and the Higher Cognition layer stores long-term rhythm memory to modulate Motor outputs, including damping when missing beats are predicted (with a forward-shifted target of 200 ms). Results show high synchronization accuracy across diverse rhythms and tempos, and beta-band-like motor activity aligns with human EEG studies, supporting the model's biological plausibility and utility for understanding rhythmic cognition.
Abstract
Rhythm is a fundamental aspect of human behaviour, present from infancy and deeply embedded in cultural practices. Rhythm anticipation is a spontaneous cognitive process that typically occurs before the onset of actual beats. While most research in both neuroscience and artificial intelligence has focused on metronome-based rhythm tasks, studies investigating the perception of complex musical rhythm patterns remain limited. To address this gap, we propose a hierarchical oscillator-based model to better understand the perception of complex musical rhythms in biological systems. The model consists of two types of coupled neurons that generate oscillations, with different layers tuned to respond to distinct perception levels. We evaluate the model using several representative rhythm patterns spanning the upper, middle, and lower bounds of human musical perception. Our findings demonstrate that, while maintaining a high degree of synchronization accuracy, the model exhibits human-like rhythmic behaviours. Additionally, the beta band neuronal activity in the model mirrors patterns observed in the human brain, further validating the biological plausibility of the approach.
