Predictable Artificial Intelligence

Lexin Zhou; Pablo A. Moreno-Casares; Fernando Martínez-Plumed; John Burden; Ryan Burnell; Lucy Cheke; Cèsar Ferri; Alexandru Marcoci; Behzad Mehrbakhsh; Yael Moros-Daval; Seán Ó hÉigeartaigh; Danaja Rutar; Wout Schellaert; Konstantinos Voudouris; José Hernández-Orallo

Predictable Artificial Intelligence

Lexin Zhou, Pablo A. Moreno-Casares, Fernando Martínez-Plumed, John Burden, Ryan Burnell, Lucy Cheke, Cèsar Ferri, Alexandru Marcoci, Behzad Mehrbakhsh, Yael Moros-Daval, Seán Ó hÉigeartaigh, Danaja Rutar, Wout Schellaert, Konstantinos Voudouris, José Hernández-Orallo

TL;DR

It is argued that achieving predictability is crucial for fostering trust, liability, control, alignment and safety of AI ecosystems, and thus should be prioritised over performance.

Abstract

We introduce the fundamental ideas and challenges of Predictable AI, a nascent research area that explores the ways in which we can anticipate key validity indicators (e.g., performance, safety) of present and future AI ecosystems. We argue that achieving predictability is crucial for fostering trust, liability, control, alignment and safety of AI ecosystems, and thus should be prioritised over performance. We formally characterise predictability, explore its most relevant components, illustrate what can be predicted, describe alternative candidates for predictors, as well as the trade-offs between maximising validity and predictability. To illustrate these concepts, we bring an array of illustrative examples covering diverse ecosystem configurations. Predictable AI is related to other areas of technical and non-technical AI research, but have distinctive questions, hypotheses, techniques and challenges. This paper aims to elucidate them, calls for identifying paths towards a landscape of predictably valid AI systems and outlines the potential impact of this emergent field.

Predictable Artificial Intelligence

TL;DR

It is argued that achieving predictability is crucial for fostering trust, liability, control, alignment and safety of AI ecosystems, and thus should be prioritised over performance.

Abstract

Paper Structure (11 sections, 4 equations, 3 figures, 5 tables)

This paper contains 11 sections, 4 equations, 3 figures, 5 tables.

What is Predictable AI?
The centrality and importance of Predictable AI
AI ecosystems and predictability
Predictability
What can be predicted and where to operate?
Framing predictability
Who predicts and how?
Scenarios
The trade-offs
Challenges and opportunities
Impact and vision

Figures (3)

Figure 1: A driving scenario where six AI systems (A,B,C,D,E,F) control self-driving cars, with all systems having the same expected validity (62.5%); the grids of colour green, yellow and red represent fully valid, partially valid, and invalid cases, respectively. The distributions of validity for the six systems, $p_A, p_B, p_C, p_D, p_E, p_F$, differ across windingness and fogginess. Which one is the most predictable and hence better? Illustration adapted from burnell2022not.
Figure 2: Performance of a selection of GPT and LLaMA models over human difficulty on the ReliabilityBench benchmark zhou2024llmrel. The values are split by correct, avoidant and incorrect results. The $x$-axis is split into 30 equal-sized bins, whose ranges must be taken as indicative of different distributions of perceived human difficulty across benchmarks.
Figure 3: Scaling laws of neural models kaplan2020scaling. The test loss is predictable from the compute used during training, the training dataset size and the number of parameters of the model.

Predictable Artificial Intelligence

TL;DR

Abstract

Predictable Artificial Intelligence

Authors

TL;DR

Abstract

Table of Contents

Figures (3)