Theoretical Bounds for Stable In-Context Learning

Tongxi Wang; Zhuoyang Xia

Theoretical Bounds for Stable In-Context Learning

Tongxi Wang, Zhuoyang Xia

TL;DR

This paper designs a two-stage observable estimator that requires no prior knowledge and returns a concrete prompt length with a prescribed failure probability, and derives a non-asymptotic sufficient sample-size requirement (a lower bound on $K$) under sub-Gaussian representations, which induces a conservative upper bound on the unknown stability threshold.

Abstract

In-context learning (ICL) is a pivotal capability for the practical deployment of large-scale language models, yet its stability heavily depends on the number of examples provided in the prompt. Existing methods lack computable theoretical guidance to determine the minimal number of examples required. Heuristic rules commonly used in practice are often overly conservative and non-verifiable, readily leading to either instability from insufficient examples or inefficiency from redundant ones. This paper proposes that ICL stability can be characterized via a spectral-coverage proxy: the smallest eigenvalue of a regularized empirical second-moment matrix of demonstration representations, turning prompt-length selection into a computable estimation problem. We derive a non-asymptotic sufficient sample-size requirement (a lower bound on $K$) under sub-Gaussian representations, which in turn induces a conservative upper bound on the unknown stability threshold. We design a two-stage observable estimator that requires no prior knowledge and returns a concrete prompt length with a prescribed failure probability. Experiments show that the resulting estimates consistently upper-bound empirical knee-points, and a lightweight calibration further tightens the gap to about $1.03$--$1.20\times$, providing verifiable guidance for practical ICL prompt design.

Theoretical Bounds for Stable In-Context Learning

TL;DR

Abstract

Theoretical Bounds for Stable In-Context Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (16)