Uncertainty in Natural Language Processing: Sources, Quantification, and Applications
Mengting Hu, Zhen Zhang, Shiwan Zhao, Minlie Huang, Bingzhe Wu
TL;DR
This survey addresses uncertainty in NLP by first classifying its sources into input, system, and output, then detailing three main estimation approaches (calibration-based, sampling-based, distribution-based) and a set of evaluation metrics. It then surveys applications in data filtering, active learning, OOD detection, selective prediction, and efficiency/performance improvements, followed by a discussion of challenges posed by high-dimensional language spaces, variable-length generation, and ethical considerations. The authors argue for a holistic framework that combines theory, methods, and practical guidance to improve reliability and trustworthiness of NLP systems, especially for safety-critical applications. The work also highlights future directions for scalable uncertainty estimation in large pretrained language models and the need for clear uncertainty expression in natural language.
Abstract
As a main field of artificial intelligence, natural language processing (NLP) has achieved remarkable success via deep neural networks. Plenty of NLP tasks have been addressed in a unified manner, with various tasks being associated with each other through sharing the same paradigm. However, neural networks are black boxes and rely on probability computation. Making mistakes is inevitable. Therefore, estimating the reliability and trustworthiness (in other words, uncertainty) of neural networks becomes a key research direction, which plays a crucial role in reducing models' risks and making better decisions. Therefore, in this survey, we provide a comprehensive review of uncertainty-relevant works in the NLP field. Considering the data and paradigms characteristics, we first categorize the sources of uncertainty in natural language into three types, including input, system, and output. Then, we systemically review uncertainty quantification approaches and the main applications. Finally, we discuss the challenges of uncertainty estimation in NLP and discuss potential future directions, taking into account recent trends in the field. Though there have been a few surveys about uncertainty estimation, our work is the first to review uncertainty from the NLP perspective.
