Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning
Jesús Armenta-Segura, Grigori Sidorov
TL;DR
One of the most comprehensive free datasets for predicting anime popularity using only features accessible before huge investments using only features accessible before huge investments is introduced, relying solely on freely available internet data and adhering to rigorous standards based on real-life experiences.
Abstract
In the japanese anime industry, predicting whether an upcoming product will be popular is crucial. This paper presents a dataset and methods on predicting anime popularity using a multimodal textimage dataset constructed exclusively from freely available internet sources. The dataset was built following rigorous standards based on real-life investment experiences. A deep neural network architecture leveraging GPT-2 and ResNet-50 to embed the data was employed to investigate the correlation between the multimodal text-image input and a popularity score, discovering relevant strengths and weaknesses in the dataset. To measure the accuracy of the model, mean squared error (MSE) was used, obtaining a best result of 0.011 when considering all inputs and the full version of the deep neural network, compared to the benchmark MSE 0.412 obtained with traditional TF-IDF and PILtotensor vectorizations. This is the first proposal to address such task with multimodal datasets, revealing the substantial benefit of incorporating image information, even when a relatively small model (ResNet-50) was used to embed them.
