Attention Mamba: Time Series Modeling with Adaptive Pooling Acceleration and Receptive Field Enhancements
Sijie Xiong, Shuqing Liu, Cheng Tang, Fumiya Okubo, Haoling Xiong, Atsushi Shimada
TL;DR
Attention Mamba tackles the limitations of attention and receptive field constraints in time series forecasting by integrating an Adaptive Pooling block that accelerates attention while injecting global information and a bidirectional Mamba block to enhance long-short dependencies. The framework yields improved nonlinear dependency extraction and expanded receptive fields, leading to state-of-the-art performance across diverse real-world datasets. Extensive experiments and ranking analyses demonstrate robust advantages over Transformer-based, Linear-based, and other Mamba variants, with a favorable accuracy-to-complexity trade-off. The work offers practical guidance for efficient, nonlinear, long-range dependency modeling in time series and provides code on GitHub.
Abstract
"This work has been submitted to the lEEE for possible publication. Copyright may be transferred without noticeafter which this version may no longer be accessible." Time series modeling serves as the cornerstone of real-world applications, such as weather forecasting and transportation management. Recently, Mamba has become a promising model that combines near-linear computational complexity with high prediction accuracy in time series modeling, while facing challenges such as insufficient modeling of nonlinear dependencies in attention and restricted receptive fields caused by convolutions. To overcome these limitations, this paper introduces an innovative framework, Attention Mamba, featuring a novel Adaptive Pooling block that accelerates attention computation and incorporates global information, effectively overcoming the constraints of limited receptive fields. Furthermore, Attention Mamba integrates a bidirectional Mamba block, efficiently capturing long-short features and transforming inputs into the Value representations for attention mechanisms. Extensive experiments conducted on diverse datasets underscore the effectiveness of Attention Mamba in extracting nonlinear dependencies and enhancing receptive fields, establishing superior performance among leading counterparts. Our codes will be available on GitHub.
