Boosting Bitcoin Minute Trend Prediction Using the Separation Index
Zeinab Shahsafdari, Ahmad Kalhor
TL;DR
Bitcoin minute-trend forecasting is challenging due to extreme volatility. The authors propose Separation Index (SI), a normalized distance-based separability metric, to rank and selectively retain observations that maximize class separation before training: $SI(Data) = \sum_{i=1}^{m} \delta(l_i, l_{i^*})$, with $i^* = \arg_{q\neq i} \min ||x_i - x_q||^2$. Leveraging minute-level data from Freqtrade on KuCoin (2018–2022) and Bayes-optimized technical indicators plus lag features, they train a BiLSTM for direction and a CNN for magnitude, combined via a voting classifier. The method yields strong test performance, with a maximum of $83.70\%$ accuracy (average $80.23\%$) and shows that SI-guided data selection can closely align with or exceed results obtained from larger, less carefully curated feature sets. The approach demonstrates a practical, data-efficient path toward high-frequency Bitcoin forecasting without relying on fundamental or social-sentiment data, offering meaningful implications for HFT decision-support systems.
Abstract
Predicting the trend of Bitcoin, a highly volatile cryptocurrency, remains a challenging task. Accurate forecasting holds immense potential for investors and market participants dealing with High Frequency Trading systems. The purpose of this study is to demonstrate the significance of using a systematic approach toward selecting informative observations for enhancing Bitcoin minute trend prediction. While a multitude of data collection methods exist, a crucial barrier remains: efficiently selecting the most informative data for building powerful prediction models. This study tackles this challenge head-on by introducing the Separation Index, a groundbreaking tool for fast and effective data (feature) subset selection. The Separation Index operates by measuring the improvement in class separability (i.e. upward vs. downward trends) with each added feature set. This innovative metric guides the creation of a highly informative dataset, maximizing the model's ability to differentiate between price movements. Our research demonstrates the effectiveness of this approach, achieving unprecedented accuracy in minute-scale Bitcoin trend prediction, surpassing the performance of previous studies. This significant advancement paves the way for a new era of data-driven decision-making in the dynamic world of cryptocurrency markets.
