Higher Order Transformers: Enhancing Stock Movement Prediction On Multimodal Time-Series Data
Soroush Omranpour, Guillaume Rabusseau, Reihaneh Rabbany
TL;DR
The paper tackles stock movement prediction by introducing Higher Order Transformers that extend self-attention to higher-order, tensor-valued inputs, enabling rich cross-variable and temporal interactions. It couples a low-rank Kronecker decomposition with kernelized linear attention to achieve scalable computation, and an encoder-decoder multimodal architecture to fuse price data with tweet signals. Empirical results on the Stocknet dataset show competitive performance, with notable gains from multimodal integration and the proposed attention mechanisms, while ablations affirm the value of joint stock/time attention and kernel attention. The work demonstrates practical potential for improved market predictions and lays groundwork for future checks on profitability across additional datasets and real-world scenarios.
Abstract
In this paper, we tackle the challenge of predicting stock movements in financial markets by introducing Higher Order Transformers, a novel architecture designed for processing multivariate time-series data. We extend the self-attention mechanism and the transformer architecture to a higher order, effectively capturing complex market dynamics across time and variables. To manage computational complexity, we propose a low-rank approximation of the potentially large attention tensor using tensor decomposition and employ kernel attention, reducing complexity to linear with respect to the data size. Additionally, we present an encoder-decoder model that integrates technical and fundamental analysis, utilizing multimodal signals from historical prices and related tweets. Our experiments on the Stocknet dataset demonstrate the effectiveness of our method, highlighting its potential for enhancing stock movement prediction in financial markets.
