Application of Machine Learning in Stock Market Forecasting: A Case Study of Disney Stock
Dengxin Huang
TL;DR
This work addresses predicting Disney stock returns by combining classic asset-pricing factors with data-driven feature engineering in a 750×16 dataset. It systematically compares Fama French 3-factor modeling, linear regression, random forest, and gradient boosting, after rigorous data cleaning, preprocessing, and clustering-based feature construction. The key finding is that linear regression using common factors and a clustering-derived idiosyncratic feature achieves the highest accuracy ($95.23\%$), outperforming nonparametric ensemble methods. The study demonstrates the practical value of integrating established financial factors with machine learning features for stock return prediction and suggests expanding the analysis to more stocks and longer periods to enhance generalizability.
Abstract
This document presents a stock market analysis conducted on a dataset consisting of 750 instances and 16 attributes donated in 2014-10-23. The analysis includes an exploratory data analysis (EDA) section, feature engineering, data preparation, model selection, and insights from the analysis. The Fama French 3-factor model is also utilized in the analysis. The results of the analysis are presented, with linear regression being the best-performing model.
