Variance reduction in output from generative AI

Yu Xie; Yueqi Xie

Variance reduction in output from generative AI

Yu Xie, Yueqi Xie

TL;DR

It is demonstrated that generative AI models are inherently prone to the phenomenon of regression toward the mean, whereby variance in output tends to shrink relative to that in real-world distributions.

Abstract

Generative AI models, such as ChatGPT, will increasingly replace humans in producing output for a variety of important tasks. While much prior work has mostly focused on the improvement in the average performance of generative AI models relative to humans' performance, much less attention has been paid to the significant reduction of variance in output produced by generative AI models. In this Perspective, we demonstrate that generative AI models are inherently prone to the phenomenon of "regression toward the mean" whereby variance in output tends to shrink relative to that in real-world distributions. We discuss potential social implications of this phenomenon across three levels-societal, group, and individual-and two dimensions-material and non-material. Finally, we discuss interventions to mitigate negative effects, considering the roles of both service providers and users. Overall, this Perspective aims to raise awareness of the importance of output variance in generative AI and to foster collaborative efforts to meet the challenges posed by the reduction of variance in output generated by AI models.

Variance reduction in output from generative AI

TL;DR

Abstract

Variance reduction in output from generative AI

TL;DR

Abstract

Paper Structure

Figures (1)