TimeKAN: KAN-based Frequency Decomposition Learning Architecture for Long-term Time Series Forecasting
Songtao Huang, Zhen Zhao, Can Li, Lei Bai
TL;DR
TimeKAN tackles long-term time series forecasting by explicitly decoupling multi-frequency components through a Decomposition-Learning-Mixing pipeline. It combines Cascaded Frequency Decomposition (CFD) to obtain per-frequency representations with a novel Multi-order KAN (M-KAN) that uses Chebyshev polynomial-based univariate functions of increasing order, and Frequency Mixing to reconstruct the original format before forecasting with a linear head. The approach leverages hierarchical preprocessing via moving averages and FFT-based frequency handling to model frequency-specific temporal patterns with lightweight parameters, achieving state-of-the-art results on several real-world datasets while remaining computationally efficient. Empirically, TimeKAN shows strong performance across multiple horizons, with ablations confirming the value of frequency upsampling, multi-order KANs, and depthwise convolution, and a notable improvement when longer look-back windows provide richer frequency information.
Abstract
Real-world time series often have multiple frequency components that are intertwined with each other, making accurate time series forecasting challenging. Decomposing the mixed frequency components into multiple single frequency components is a natural choice. However, the information density of patterns varies across different frequencies, and employing a uniform modeling approach for different frequency components can lead to inaccurate characterization. To address this challenges, inspired by the flexibility of the recent Kolmogorov-Arnold Network (KAN), we propose a KAN-based Frequency Decomposition Learning architecture (TimeKAN) to address the complex forecasting challenges caused by multiple frequency mixtures. Specifically, TimeKAN mainly consists of three components: Cascaded Frequency Decomposition (CFD) blocks, Multi-order KAN Representation Learning (M-KAN) blocks and Frequency Mixing blocks. CFD blocks adopt a bottom-up cascading approach to obtain series representations for each frequency band. Benefiting from the high flexibility of KAN, we design a novel M-KAN block to learn and represent specific temporal patterns within each frequency band. Finally, Frequency Mixing blocks is used to recombine the frequency bands into the original format. Extensive experimental results across multiple real-world time series datasets demonstrate that TimeKAN achieves state-of-the-art performance as an extremely lightweight architecture. Code is available at https://github.com/huangst21/TimeKAN.
