Table of Contents
Fetching ...

Auditing large language models: a three-layered approach

Jakob Mökander, Jonas Schuett, Hannah Rose Kirk, Luciano Floridi

TL;DR

The paper tackles the governance challenge posed by large language models (LLMs) by proposing a novel three-layer auditing framework that coordinates governance audits of providers, model audits of pre-trained LLMs, and continuous application audits of downstream uses. It argues that auditing must blend risk and compliance perspectives, require external independent review, and foster collaboration between providers and auditors, while emphasizing ex-ante and ex-post monitoring to address emergent, adaptable capabilities. Seven design claims ground the blueprint, highlighting the limits of application-only or purely technical audits and the need to link outputs across levels for meaningful assurance. The work also discusses limitations—normative metric operationalisation, institutional ecosystems, and non-technical risk factors—while outlining avenues for methodological development, institutional experimentation, and policy collaboration to realize robust, multi-stakeholder LLM governance.

Abstract

Large language models (LLMs) represent a major advance in artificial intelligence (AI) research. However, the widespread use of LLMs is also coupled with significant ethical and social challenges. Previous research has pointed towards auditing as a promising governance mechanism to help ensure that AI systems are designed and deployed in ways that are ethical, legal, and technically robust. However, existing auditing procedures fail to address the governance challenges posed by LLMs, which display emergent capabilities and are adaptable to a wide range of downstream tasks. In this article, we address that gap by outlining a novel blueprint for how to audit LLMs. Specifically, we propose a three-layered approach, whereby governance audits (of technology providers that design and disseminate LLMs), model audits (of LLMs after pre-training but prior to their release), and application audits (of applications based on LLMs) complement and inform each other. We show how audits, when conducted in a structured and coordinated manner on all three levels, can be a feasible and effective mechanism for identifying and managing some of the ethical and social risks posed by LLMs. However, it is important to remain realistic about what auditing can reasonably be expected to achieve. Therefore, we discuss the limitations not only of our three-layered approach but also of the prospect of auditing LLMs at all. Ultimately, this article seeks to expand the methodological toolkit available to technology providers and policymakers who wish to analyse and evaluate LLMs from technical, ethical, and legal perspectives.

Auditing large language models: a three-layered approach

TL;DR

The paper tackles the governance challenge posed by large language models (LLMs) by proposing a novel three-layer auditing framework that coordinates governance audits of providers, model audits of pre-trained LLMs, and continuous application audits of downstream uses. It argues that auditing must blend risk and compliance perspectives, require external independent review, and foster collaboration between providers and auditors, while emphasizing ex-ante and ex-post monitoring to address emergent, adaptable capabilities. Seven design claims ground the blueprint, highlighting the limits of application-only or purely technical audits and the need to link outputs across levels for meaningful assurance. The work also discusses limitations—normative metric operationalisation, institutional ecosystems, and non-technical risk factors—while outlining avenues for methodological development, institutional experimentation, and policy collaboration to realize robust, multi-stakeholder LLM governance.

Abstract

Large language models (LLMs) represent a major advance in artificial intelligence (AI) research. However, the widespread use of LLMs is also coupled with significant ethical and social challenges. Previous research has pointed towards auditing as a promising governance mechanism to help ensure that AI systems are designed and deployed in ways that are ethical, legal, and technically robust. However, existing auditing procedures fail to address the governance challenges posed by LLMs, which display emergent capabilities and are adaptable to a wide range of downstream tasks. In this article, we address that gap by outlining a novel blueprint for how to audit LLMs. Specifically, we propose a three-layered approach, whereby governance audits (of technology providers that design and disseminate LLMs), model audits (of LLMs after pre-training but prior to their release), and application audits (of applications based on LLMs) complement and inform each other. We show how audits, when conducted in a structured and coordinated manner on all three levels, can be a feasible and effective mechanism for identifying and managing some of the ethical and social risks posed by LLMs. However, it is important to remain realistic about what auditing can reasonably be expected to achieve. Therefore, we discuss the limitations not only of our three-layered approach but also of the prospect of auditing LLMs at all. Ultimately, this article seeks to expand the methodological toolkit available to technology providers and policymakers who wish to analyse and evaluate LLMs from technical, ethical, and legal perspectives.
Paper Structure (25 sections, 2 figures)

This paper contains 25 sections, 2 figures.

Figures (2)

  • Figure 1: Blueprint for how to audit LLMs: A three-layered approach.
  • Figure 2: Outputs from audits on one level become inputs for audits on other levels.