LogiCode: an LLM-Driven Framework for Logical Anomaly Detection
Yiheng Zhang, Yunkang Cao, Xiaohao Xu, Weiming Shen
TL;DR
LogiCode addresses the need to detect high-level logical anomalies in industrial settings, beyond traditional structural defect detection. It uses LLMs to translate normal-rule knowledge into executable Python code that analyzes images and provides explanations for detected anomalies. The approach introduces LOCO-Annotations and LogiBench datasets for evaluating detection accuracy, code-generation reliability, and reasoning quality, reporting strong binary accuracy and reasoning performance, with code-generation still improving. The work suggests that LLM-driven reasoning and programmable APIs can enhance interpretability and efficiency in industrial quality control, with potential broad impact across manufacturing contexts.
Abstract
This paper presents LogiCode, a novel framework that leverages Large Language Models (LLMs) for identifying logical anomalies in industrial settings, moving beyond traditional focus on structural inconsistencies. By harnessing LLMs for logical reasoning, LogiCode autonomously generates Python codes to pinpoint anomalies such as incorrect component quantities or missing elements, marking a significant leap forward in anomaly detection technologies. A custom dataset "LOCO-Annotations" and a benchmark "LogiBench" are introduced to evaluate the LogiCode's performance across various metrics including binary classification accuracy, code generation success rate, and precision in reasoning. Findings demonstrate LogiCode's enhanced interpretability, significantly improving the accuracy of logical anomaly detection and offering detailed explanations for identified anomalies. This represents a notable shift towards more intelligent, LLM-driven approaches in industrial anomaly detection, promising substantial impacts on industry-specific applications.
