Table of Contents
Fetching ...

A note on the impossibility of conditional PAC-efficient reasoning in large language models

Hao Zeng

TL;DR

The paper addresses whether conditional PAC-efficient reasoning is possible for large language-model routing. It formalizes a router-based composite predictor that switches between an expensive expert and a cheaper fast model and defines the pointwise risk. The main result shows that, under distribution-free assumptions and non-atomic input spaces, achieving conditional PAC efficiency forces the routing to defer to the expert with probability at least 1-α for almost every input, yielding no practical speedups. This extends known marginal PAC guarantees and aligns with impossibility results in conformal prediction, guiding practitioners to prioritize marginal guarantees or adopt distributional assumptions for conditional notions.

Abstract

We prove an impossibility result for conditional Probably Approximately Correct (PAC)-efficient reasoning in large language models. While recent work has established marginal PAC efficiency guarantees for composite models that switch between expensive expert models and cheaper fast models, we show that conditional (pointwise) guarantees are impossible in the distribution-free setting. Specifically, for non-atomic input spaces, any algorithm achieving conditional PAC efficiency must be trivial in the sense that it defers to the expert model with probability at least $1-α$ for almost every input.

A note on the impossibility of conditional PAC-efficient reasoning in large language models

TL;DR

The paper addresses whether conditional PAC-efficient reasoning is possible for large language-model routing. It formalizes a router-based composite predictor that switches between an expensive expert and a cheaper fast model and defines the pointwise risk. The main result shows that, under distribution-free assumptions and non-atomic input spaces, achieving conditional PAC efficiency forces the routing to defer to the expert with probability at least 1-α for almost every input, yielding no practical speedups. This extends known marginal PAC guarantees and aligns with impossibility results in conformal prediction, guiding practitioners to prioritize marginal guarantees or adopt distributional assumptions for conditional notions.

Abstract

We prove an impossibility result for conditional Probably Approximately Correct (PAC)-efficient reasoning in large language models. While recent work has established marginal PAC efficiency guarantees for composite models that switch between expensive expert models and cheaper fast models, we show that conditional (pointwise) guarantees are impossible in the distribution-free setting. Specifically, for non-atomic input spaces, any algorithm achieving conditional PAC efficiency must be trivial in the sense that it defers to the expert model with probability at least for almost every input.

Paper Structure

This paper contains 7 sections, 2 theorems, 9 equations.

Key Result

Theorem 3

Let $\mathcal{X}$ be a non-atomic complete separable metric space. Assume the fast model $\tilde{f}$ has non-trivial loss, i.e., there exists $E \subset \mathcal{X}$ with $P_X(E) > 0$ such that $\ell(\tilde{f}(x), f(x)) > \epsilon$ for all $x \in E$. Then an algorithm $\mathcal{A}$ is $(\epsilon, \a

Theorems & Definitions (6)

  • Definition 1: Marginal PAC efficiency zeng2025pac
  • Theorem 3: Impossibility
  • Remark 4
  • proof : Proof of Theorem \ref{['thm:impossibility']}
  • Lemma 5: Finite-sample indistinguishability
  • proof