'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Samuel Judson; Matthew Elacqua; Filip Cano; Timos Antonopoulos; Bettina Könighofer; Scott J. Shapiro; Ruzica Piskac

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Samuel Judson, Matthew Elacqua, Filip Cano, Timos Antonopoulos, Bettina Könighofer, Scott J. Shapiro, Ruzica Piskac

TL;DR

It is shown that under minimal assumptions automated reasoning can rigorously interrogate algorithmic behaviors as in the adversarial process of legal fact finding.

Abstract

Principled accountability in the aftermath of harms is essential to the trustworthy design and governance of algorithmic decision making. Legal theory offers a paramount method for assessing culpability: putting the agent 'on the stand' to subject their actions and intentions to cross-examination. We show that under minimal assumptions automated reasoning can rigorously interrogate algorithmic behaviors as in the adversarial process of legal fact finding. We model accountability processes, such as trials or review boards, as Counterfactual-Guided Logic Exploration and Abstraction Refinement (CLEAR) loops. We use the formal methods of symbolic execution and satisfiability modulo theories (SMT) solving to discharge queries about agent behavior in factual and counterfactual scenarios, as adaptively formulated by a human investigator. In order to do so, for a decision algorithm $\mathcal{A}$ we use symbolic execution to represent its logic as a statement $Π$ in the decidable theory $\texttt{QF_FPBV}$. We implement our framework and demonstrate its utility on an illustrative car crash scenario.

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

TL;DR

It is shown that under minimal assumptions automated reasoning can rigorously interrogate algorithmic behaviors as in the adversarial process of legal fact finding.

Abstract

we use symbolic execution to represent its logic as a statement

in the decidable theory

. We implement our framework and demonstrate its utility on an illustrative car crash scenario.

Paper Structure (43 sections, 2 theorems, 17 equations, 9 figures, 2 tables, 1 algorithm)

This paper contains 43 sections, 2 theorems, 17 equations, 9 figures, 2 tables, 1 algorithm.

Introduction
Contribution
Outline.
Related Work
Motivation
Legal Accountability for ADMs
Technical Background
Programs and Traces
SMT-based Program Analysis
SMT Solving.
Symbolic Execution.
Counterfactual Reasoning
Formalism.
Formal Reasoning for Accountability
Counterfactual-Guided Logic Exploration
...and 28 more sections

Key Result

Theorem 5.5

Let $q_i = ((\varphi, \psi), \beta)$ be a factual query, and let $(\tau^f, \, t, \, \varphi, \, \psi)$ be a corresponding factual scenario. Then is valid iff $\tau^f \models \varphi(\hat{E}) \land \psi(\hat{S}) \to_{\mathcal{A}, t, \ell} \beta(\hat{D})$.

Figures (9)

Figure 1: A broadside car crash rendered in the $\textsf{soid}$ GUI.
Figure 2: Longitudinal Proper Response Guard (see Def. 4.1 of shalev2017formal) written in C. Although not used by the simulated self-driving cars of our case study, this sort of decision functionality is representative of what $\textsf{soid}$ is designed to investigate.
Figure 3: Counterfactual-guided Abstraction Refinement of the code in Figure \ref{['fig:long-pr-example']}. A factual query $q_1$ and a pair of counterfactual queries, $q_2, \, q_3$, all with the same $\beta$, produce the diagrammed symbolic traces and the resultant abstraction $\hat{A}$ at bottom right. Concrete states are rendered with a typewriter font, while symbolic states are rendered in Garamond.
Figure 4: Still of the $\textsf{soid}$ GUI (with a small section cut out for brevity). At top left is the critical moment from the program logs as chosen by the investigator. At bottom right are the counterfactual conditions the investigator has specified.
Figure 5: Our decision tree example. At top, the relevant decision subtree for a misclassification based on health data, with the incorrect path taken in red -- and the correct branch missed in blue -- as the unit conversion bug leads to a significantly smaller BMI input than is correct. At bottom, the (otherwise correct) decision tree inference logic in C.
...and 4 more figures

Theorems & Definitions (10)

Definition 3.1
Definition 3.2
Definition 5.1
Definition 5.2
Definition 5.3
Definition 5.4
Theorem 5.5
proof
Theorem 5.6
proof

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

TL;DR

Abstract

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (10)