Learning Transactions Representations for Information Management in Banks: Mastering Local, Global, and External Knowledge
Alexandra Bazarova, Maria Kovaleva, Ilya Kuleshov, Evgenia Romanenkova, Alexander Stepikin, Alexandr Yugay, Dzhambulat Mollaev, Ivan Kireev, Andrey Savchenko, Alexey Zaytsev
TL;DR
This study tackles the problem of learning transaction representations for banks that balance global (history-wide) and local (momentary) properties. It benchmarks eight unsupervised approaches spanning generative and contrastive SSL, temporal point processes, and an external-context enrichment technique that aggregates information from other clients. The findings show generative approaches excel at local tasks while contrastive methods perform well on global classification, with external-context aggregation providing substantial local gains (up to $20\%$). A comprehensive evaluation across open datasets (Churn, Default, HSBC, Age) and a real private dataset demonstrates the approach's practical potential for scalable, versatile information management in banking.
Abstract
In today's world, banks use artificial intelligence to optimize diverse business processes, aiming to improve customer experience. Most of the customer-related tasks can be categorized into two groups: 1) local ones, which focus on a client's current state, such as transaction forecasting, and 2) global ones, which consider the general customer behaviour, e.g., predicting successful loan repayment. Unfortunately, maintaining separate models for each task is costly. Therefore, to better facilitate information management, we compared eight state-of-the-art unsupervised methods on 11 tasks in search for a one-size-fits-all solution. Contrastive self-supervised learning methods were demonstrated to excel at global problems, while generative techniques were superior at local tasks. We also introduced a novel approach, which enriches the client's representation by incorporating external information gathered from other clients. Our method outperforms classical models, boosting accuracy by up to 20\%.
