Large Investment Model
Jian Guo, Heung-Yeung Shum
TL;DR
The paper introduces the Large Investment Model (LIM) to overcome diminishing returns and rising costs in quantitative investment by coupling an upstream foundation model trained on diverse, multi-exchange data with a downstream task model fine-tuned for specific strategies. It emphasizes end-to-end modeling and universal modeling to enable direct strategy generation and cross-task transfer, supported by a patch-based, Transformer-inspired upstream architecture and a data-aligned downstream workflow. The authors detail LIM's system architecture, upstream/downstream design, and automated strategy generation, and they present numerical experiments on cross-instrument commodity futures prediction that leverage insights from stock markets. They also discuss practical challenges (e.g., retraining costs, data selection, integration of alternative data) and outline future research directions, including risk modeling, world simulation, multi-granularity backbones, and agent-enabled scalability. Overall, LIM is positioned as a scalable, knowledge-accelerating paradigm for quantitative finance that aims to unify data augmentation, transfer learning, and automated strategy development across markets and speeds.
Abstract
Traditional quantitative investment research is encountering diminishing returns alongside rising labor and time costs. To overcome these challenges, we introduce the Large Investment Model (LIM), a novel research paradigm designed to enhance both performance and efficiency at scale. LIM employs end-to-end learning and universal modeling to create an upstream foundation model capable of autonomously learning comprehensive signal patterns from diverse financial data spanning multiple exchanges, instruments, and frequencies. These "global patterns" are subsequently transferred to downstream strategy modeling, optimizing performance for specific tasks. We detail the system architecture design of LIM, address the technical challenges inherent in this approach, and outline potential directions for future research. The advantages of LIM are demonstrated through a series of numerical experiments on cross-instrument prediction for commodity futures trading, leveraging insights from stock markets.
