LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models

Peng Cai; Reza Ryan; Nickson M. Karie

LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models

Peng Cai, Reza Ryan, Nickson M. Karie

TL;DR

LLMLogAnalyzer addresses the challenge of analyzing massive, diverse logs by integrating clustering-based parsing (Drain) with retrieval-augmented generation (RAG) in a modular four-stage framework (indexing, parsing, query, generation). A two-level router directs queries to appropriate search tools (keyword, event, semantic) and combines results with LLM-based generation to produce context-aware, cited answers. Across four Loghub-2.0 datasets and seven analysis tasks, the system outperforms baselines such as ChatGPT, ChatPDF, and NotebookLM, with the 70B variant showing the strongest performance and robustness (lower variability). The approach demonstrates a scalable, accessible path for non-technical users to perform sophisticated log analysis using open-source models and on-premises deployments, with clear avenues for enterprise-scale extension and benchmarking against domain-specific solutions.

Abstract

System logs are a cornerstone of cybersecurity, supporting proactive breach prevention and post-incident investigations. However, analyzing vast amounts of diverse log data remains significantly challenging, as high costs, lack of in-house expertise, and time constraints make even basic analysis difficult for many organizations. This study introduces LLMLogAnalyzer, a clustering-based log analysis chatbot that leverages Large Language Models (LLMs) and Machine Learning (ML) algorithms to simplify and streamline log analysis processes. This innovative approach addresses key LLM limitations, including context window constraints and poor structured text handling capabilities, enabling more effective summarization, pattern extraction, and anomaly detection tasks. LLMLogAnalyzer is evaluated across four distinct domain logs and various tasks. Results demonstrate significant performance improvements over state-of-the-art LLM-based chatbots, including ChatGPT, ChatPDF, and NotebookLM, with consistent gains ranging from 39% to 68% across different tasks. The system also exhibits strong robustness, achieving a 93% reduction in interquartile range (IQR) when using ROUGE-1 scores, indicating significantly lower result variability. The framework's effectiveness stems from its modular architecture comprising a router, log recognizer, log parser, and search tools. This design enhances LLM capabilities for structured text analysis while improving accuracy and robustness, making it a valuable resource for both cybersecurity experts and non-technical users.

LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models

TL;DR

Abstract

LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)