Selective Risk Certification for LLM Outputs via Information-Lift Statistics: PAC-Bayes, Robustness, and Skeleton Design
Sanjeda Akter, Ibne Farabi Shihab, Anuj Sharma
TL;DR
The paper introduces information-lift certificates for LLM outputs, leveraging heavy-tail aware sub-gamma PAC-Bayes bounds to provide anytime-valid, sequence-level uncertainty certification that compares model outputs to a skeleton baseline. By defining a clipped information lift and aggregating it across autoregressive sequences, the authors enable formal abstention decisions with finite-sample guarantees and a robust skeleton design via Variational Skeleton Design (VSD). Empirical results across eight diverse datasets show substantial gains in coverage at fixed risk (77.0% at 2% risk) and exceptional blocking of critical errors (96% on a biomedical challenge set) compared with entropy-based baselines, while maintaining practical runtime overhead and adapting to top-k API constraints. The approach emphasizes robustness to distributional shifts and skeleton misspecification, though it remains frequency-based rather than severity-aware, highlighting avenues for future work in harm-aware and severity-weighted certification.
Abstract
Large language models often produce confident but incorrect outputs, creating a critical need for reliable uncertainty quantification with formal abstention guarantees. We introduce information-lift certificates that compare model probabilities to a skeleton baseline, accumulating evidence through sub-gamma PAC-Bayes bounds that remain valid under heavy-tailed distributions where standard concentration inequalities fail. On eight diverse datasets, our method achieves 77.0\% coverage at 2\% risk, outperforming recent baselines by 10.0 percentage points on average. In high-stakes scenarios, we block 96\% of critical errors compared to 18-31\% for entropy-based methods. While our frequency-based certification does not guarantee severity-weighted safety and depends on skeleton quality, performance degrades gracefully under distributional shifts, making the approach practical for real-world deployment.
