An Overview of Large Language Models for Statisticians

Wenlong Ji; Weizhe Yuan; Emily Getzen; Kyunghyun Cho; Michael I. Jordan; Song Mei; Jason E Weston; Weijie J. Su; Jing Xu; Linjun Zhang

An Overview of Large Language Models for Statisticians

Wenlong Ji, Weizhe Yuan, Emily Getzen, Kyunghyun Cho, Michael I. Jordan, Song Mei, Jason E Weston, Weijie J. Su, Jing Xu, Linjun Zhang

TL;DR

The paper surveys how statisticians can contribute to large language models by integrating statistical rigor into training, evaluation, and deployment to improve trustworthiness. It covers the full LLM lifecycle—from representation learning and Transformer architectures to prompting, fine-tuning, alignment, and self-alignment—with a strong emphasis on uncertainty quantification, privacy, fairness, watermarking, and interpretability. It then articulates how LLMs can empower statistical analysis, data collection, cleaning, and medical research, while discussing the reciprocal role of statisticians in shaping safe and effective AI systems. By offering a structured synthesis and roadmap, the paper advocates closer collaboration between statistics and AI to advance both theoretical foundations and practical societal applications of LLMs.

Abstract

Large Language Models (LLMs) have emerged as transformative tools in artificial intelligence (AI), exhibiting remarkable capabilities across diverse tasks such as text generation, reasoning, and decision-making. While their success has primarily been driven by advances in computational power and deep learning architectures, emerging problems -- in areas such as uncertainty quantification, decision-making, causal inference, and distribution shift -- require a deeper engagement with the field of statistics. This paper explores potential areas where statisticians can make important contributions to the development of LLMs, particularly those that aim to engender trustworthiness and transparency for human users. Thus, we focus on issues such as uncertainty quantification, interpretability, fairness, privacy, watermarking and model adaptation. We also consider possible roles for LLMs in statistical analysis. By bridging AI and statistics, we aim to foster a deeper collaboration that advances both the theoretical foundations and practical applications of LLMs, ultimately shaping their role in addressing complex societal challenges.

An Overview of Large Language Models for Statisticians

TL;DR

Abstract

An Overview of Large Language Models for Statisticians

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)