Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey
Ruiyao Xu, Kaize Ding
TL;DR
This survey addresses anomaly and out-of-distribution detection in the era of Large Language Models (LLMs) by introducing a two-fold taxonomy: using LLMs for detection (prompting-based and contrasting-based) and using LLMs for generation (augmentation and explanations). It systematically reviews methods across modalities, discusses PEFT and prompt-tuning strategies to adapt LLMs, and highlights the emergence of multimodal LLMs in this domain. The authors catalog datasets, compare performance across settings, and identify challenges such as explainability, hallucination, and efficiency, while outlining future directions like integrating domain knowledge and robust multimodal probing. The work underscores the potential of LLMs to provide zero-shot or few-shot detection capabilities, enhanced interpretability, and scalable data augmentation for robust anomaly and OOD detection in real-world deployments.
Abstract
Detecting anomalies or out-of-distribution (OOD) samples is critical for maintaining the reliability and trustworthiness of machine learning systems. Recently, Large Language Models (LLMs) have demonstrated their effectiveness not only in natural language processing but also in broader applications due to their advanced comprehension and generative capabilities. The integration of LLMs into anomaly and OOD detection marks a significant shift from the traditional paradigm in the field. This survey focuses on the problem of anomaly and OOD detection under the context of LLMs. We propose a new taxonomy to categorize existing approaches into two classes based on the role played by LLMs. Following our proposed taxonomy, we further discuss the related work under each of the categories and finally discuss potential challenges and directions for future research in this field. We also provide an up-to-date reading list of relevant papers.
