Breaking the Gradient Barrier: Unveiling Large Language Models for Strategic Classification
Xinpeng Lv, Yunxin Mao, Haoxuan Li, Ke Liang, Jinxuan Yang, Wanrong Huang, Haoang Chi, Huan Chen, Long Lan, Yuanlong Chen, Wenjing Yang, Haotian Wang
TL;DR
GLIM presents a gradient-free approach to strategic classification by embedding the bi-level Stackelberg optimization into pre-trained LLMs through in-context learning. The authors show, both theoretically and empirically, that ICL can implicitly simulate both the inner-stage strategic manipulation and the outer-stage decision-rule optimization without fine-tuning, effectively matching gradient-based updates in a forward pass. Across six datasets spanning finance and internet domains, GLIM demonstrates robustness, scalability, and competitive accuracy under strategic manipulation, while highlighting practical considerations like prompt costs and API usage. This work bridges strategic ML and LLMs, offering a retraining-free, scalable pathway for large-scale SC with interpretable attention-based insights.
Abstract
Strategic classification~(SC) explores how individuals or entities modify their features strategically to achieve favorable classification outcomes. However, existing SC methods, which are largely based on linear models or shallow neural networks, face significant limitations in terms of scalability and capacity when applied to real-world datasets with significantly increasing scale, especially in financial services and the internet sector. In this paper, we investigate how to leverage large language models to design a more scalable and efficient SC framework, especially in the case of growing individuals engaged with decision-making processes. Specifically, we introduce GLIM, a gradient-free SC method grounded in in-context learning. During the feed-forward process of self-attention, GLIM implicitly simulates the typical bi-level optimization process of SC, including both the feature manipulation and decision rule optimization. Without fine-tuning the LLMs, our proposed GLIM enjoys the advantage of cost-effective adaptation in dynamic strategic environments. Theoretically, we prove GLIM can support pre-trained LLMs to adapt to a broad range of strategic manipulations. We validate our approach through experiments with a collection of pre-trained LLMs on real-world and synthetic datasets in financial and internet domains, demonstrating that our GLIM exhibits both robustness and efficiency, and offering an effective solution for large-scale SC tasks.
