Transformer Encoder and Multi-features Time2Vec for Financial Prediction
Nguyen Kim Hai Bui, Nguyen Duy Chien, Péter Kovács, Gergő Bognár
TL;DR
The paper addresses financial time-series forecasting, where short-term fluctuations and long-term dependencies complicate predictions. It introduces TT2VFin, a neural architecture that fuses Time2Vec temporal encoding with a Transformer Encoder and employs correlation-based feature selection and GMNN feature aggregation to exploit inter-stock relationships. The study shows Time2Vec encoding outperforms traditional positional encoding and that correlation-driven multi-feature inputs improve accuracy, achieving competitive performance with fewer parameters than RNN/LSTM baselines. The approach demonstrates strong generalization and potential applicability to other financial and time-series forecasting problems, including intra-day data.
Abstract
Financial prediction is a complex and challenging task of time series analysis and signal processing, expected to model both short-term fluctuations and long-term temporal dependencies. Transformers have remarkable success mostly in natural language processing using attention mechanism, which also influenced the time series community. The ability to capture both short and long-range dependencies helps to understand the financial market and to recognize price patterns, leading to successful applications of Transformers in stock prediction. Although, the previous research predominantly focuses on individual features and singular predictions, that limits the model's ability to understand broader market trends. In reality, within sectors such as finance and technology, companies belonging to the same industry often exhibit correlated stock price movements. In this paper, we develop a novel neural network architecture by integrating Time2Vec with the Encoder of the Transformer model. Based on the study of different markets, we propose a novel correlation feature selection method. Through a comprehensive fine-tuning of multiple hyperparameters, we conduct a comparative analysis of our results against benchmark models. We conclude that our method outperforms other state-of-the-art encoding methods such as positional encoding, and we also conclude that selecting correlation features enhance the accuracy of predicting multiple stock prices.
