Goodness-of-fit and utility estimation: what's possible and what's not

Yujian Chen; Joshua Lanier; John K. -H. Quah

Goodness-of-fit and utility estimation: what's possible and what's not

Yujian Chen, Joshua Lanier, John K. -H. Quah

TL;DR

The paper addresses whether goodness-of-fit indices for testing utility-maximization can be both continuous and accurate. It formalizes five loss-function–based indices (Afriat, Varian, Swaps, nonlinear LS, Houtman-Maks) and proves an impossibility result: for well-behaved utilities, no index can be both continuous and accurate while yielding a minimizable, accurate loss function. It introduces essential accuracy, a robust welfare criterion, and a framework for bounded rationality (consideration-sets), plus an empirical demonstration using scanner data to compare welfare implications under soda reductions. The findings clarify fundamental limitations of standard fit measures, propose a robust welfare analysis approach when best-fitting utility functions do not exist, and characterize utility classes that admit nicer properties. Collectively, the work informs both theoretical understanding and practical welfare analysis in empirical revealed-preference settings.

Abstract

A goodness-of-fit index measures the consistency of consumption data with a given model of utility-maximization. We show that for the class of well-behaved (i.e., continuous and increasing) utility functions there is no goodness-of-fit index that is continuous and accurate, where the latter means that a perfect score is obtained if and only if a dataset can be rationalized by a well-behaved utility function. While many standard goodness-of-fit indices are inaccurate we show that these indices are (in a sense we make precise) essentially accurate. Goodness-of-fit indices are typically generated by loss functions and we find that standard loss functions usually do not yield a best-fitting utility function when they are minimized. Nonetheless, welfare comparisons can be made by working out a robust preference relation from the data.

Goodness-of-fit and utility estimation: what's possible and what's not

TL;DR

Abstract

Paper Structure (30 sections, 19 theorems, 31 equations, 4 figures, 2 tables)

This paper contains 30 sections, 19 theorems, 31 equations, 4 figures, 2 tables.

Introduction
Utility maximization and goodness-of-fit indices
Goodness-of-fit indices
Essential accuracy
Accuracy versus continuity
Loss Functions and Goodness-of-fit Indices
The Afriat Index.
The Varian Index.
The Swaps Index.
Non-Linear Least Squares.
The Houtman-Maks Index.
Loss functions to estimate utility
Popular Loss Functions
Welfare when there are no best-fitting utility functions
Loss Functions and Consideration Sets
...and 15 more sections

Key Result

Theorem 1

A purchase dataset $D = (\boldsymbol{q}^t,\boldsymbol{p}^t)_{t \leq T}$ can be rationalized by a well-behaved utility function if and only if $D$ satisfies GARP.

Figures (4)

Figure 1: The dataset on the left, referred to as $D_1$ in Example \ref{['example:bad-news']}, satisfies GARP. The sequence $\boldsymbol{q}_n = (4 - \tfrac{1}{3n}, 4 + \tfrac{2}{3n})$ tends to the location where the two budget lines cross (i.e. where the arrow is pointing in the figure to the left). The dataset which contains this limiting bundle, depicted on the right, does not satisfy GARP.
Figure 2: The GARP-violating dataset $\widebar D$. Depicted are various binary relations over the bundles $\boldsymbol{q}$, $\boldsymbol{q}'$, and $\tilde{\boldsymbol{q}}$.
Figure 3: Weak and strong compensation levels for a 25% reduction in soda consumption using the Varian index
Figure 4: Weak and strong compensation levels for a 25% reduction in soda consumption using the Varian Index

Theorems & Definitions (42)

Theorem 1: Afriat
Definition 1
Example 1
Proposition 1
Definition 2
Proposition 2
Proposition 3
Proposition 4
Example 1: continued
Proposition 5
...and 32 more

Goodness-of-fit and utility estimation: what's possible and what's not

TL;DR

Abstract

Goodness-of-fit and utility estimation: what's possible and what's not

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (42)