Table of Contents
Fetching ...

LLMTemporalComparator: A Tool for Analysing Differences in Temporal Adaptations of Large Language Models

Reinhard Friedrich Fritsch, Adam Jatowt

TL;DR

A novel system that compares in a systematic way the outputs of two LLM versions based on user-defined queries based on user-defined queries to identify differences in vocabulary, information presentation, and underlying themes is proposed.

Abstract

This study addresses the challenges of analyzing temporal discrepancies in large language models (LLMs) trained on data from different time periods. To facilitate the automatic exploration of these differences, we propose a novel system that compares in a systematic way the outputs of two LLM versions based on user-defined queries. The system first generates a hierarchical topic structure rooted in a user-specified keyword, allowing for an organized comparison of topical categories. Subsequently, it evaluates the generated text by both LLMs to identify differences in vocabulary, information presentation, and underlying themes. This fully automated approach not only streamlines the identification of shifts in public opinion and cultural norms but also enhances our understanding of the adaptability and robustness of machine learning applications in response to temporal changes. By fostering research in continual model adaptation and comparative summarization, this work contributes to the development of more transparent machine learning models capable of capturing the nuances of evolving societal contexts.

LLMTemporalComparator: A Tool for Analysing Differences in Temporal Adaptations of Large Language Models

TL;DR

A novel system that compares in a systematic way the outputs of two LLM versions based on user-defined queries based on user-defined queries to identify differences in vocabulary, information presentation, and underlying themes is proposed.

Abstract

This study addresses the challenges of analyzing temporal discrepancies in large language models (LLMs) trained on data from different time periods. To facilitate the automatic exploration of these differences, we propose a novel system that compares in a systematic way the outputs of two LLM versions based on user-defined queries. The system first generates a hierarchical topic structure rooted in a user-specified keyword, allowing for an organized comparison of topical categories. Subsequently, it evaluates the generated text by both LLMs to identify differences in vocabulary, information presentation, and underlying themes. This fully automated approach not only streamlines the identification of shifts in public opinion and cultural norms but also enhances our understanding of the adaptability and robustness of machine learning applications in response to temporal changes. By fostering research in continual model adaptation and comparative summarization, this work contributes to the development of more transparent machine learning models capable of capturing the nuances of evolving societal contexts.
Paper Structure (8 sections, 7 figures, 1 table)

This paper contains 8 sections, 7 figures, 1 table.

Figures (7)

  • Figure 1: Node Social Media is automatically divided into five subcategories
  • Figure 2: System Workflow
  • Figure 3: Category chain used to generate subcategories for root topic "Internet" and child topic "Social Media".
  • Figure 4: Tree structure with similarity score colouring.
  • Figure 5: Treemap with similarity score colouring
  • ...and 2 more figures