Objective Model Prior Probabilities in Variable Selection

James Berger; Gonzalo García-Donato; Elías Moreno; Luis Pericchi

Objective Model Prior Probabilities in Variable Selection

James Berger, Gonzalo García-Donato, Elías Moreno, Luis Pericchi

Abstract

For many years it was routine to use equal model prior probabilities in Bayesian model uncertainty analysis. At least twenty years ago it became clear that this was problematic, leading to support of much too large models in the increasingly huge model spaces being considered in genomics and other fields. A popular replacement was to adopt a suggestion of Harold Jeffreys for the variable selection problem in which a total of $k$ possible variables are being considered for inclusion in the model: give the collection of all models containing $d$ variables ($d = 0, . . . , k$) prior probability $1/(k + 1)$ and then divide this prior probability equally among the models in the collection. Many other choices of model prior probabilities that impose severe parsimony have also been introduced. We begin by reviewing the problems with using equal model prior probabilities and then discuss some serious problems with the Jeffreys choice. Finally, we introduce and study a number of objective alternative choices of model prior probabilities, from both numerical and theoretical perspectives.

Objective Model Prior Probabilities in Variable Selection

Abstract

possible variables are being considered for inclusion in the model: give the collection of all models containing

variables (

) prior probability

and then divide this prior probability equally among the models in the collection. Many other choices of model prior probabilities that impose severe parsimony have also been introduced. We begin by reviewing the problems with using equal model prior probabilities and then discuss some serious problems with the Jeffreys choice. Finally, we introduce and study a number of objective alternative choices of model prior probabilities, from both numerical and theoretical perspectives.

Paper Structure (23 sections, 1 theorem, 33 equations, 2 figures, 1 table)

This paper contains 23 sections, 1 theorem, 33 equations, 2 figures, 1 table.

Introduction and motivation
Variable selection in the General linear model
More parsimonious prior probabilities
Harmonic prior probabilities
CMG prior probabilities
Half-$p$ Jeffreys prior probabilities
Half-$k$ Jeffreys prior probabilities
Beta and hierarchical Beta prior probabilities
Overly parsimonious choices from the perspective of objectivity
Numerical comparisons of the prior probabilities
Graphs of the priors
A discriminating example
Philosophical and theoretical comparisons
Philosophical issues
When $k=1$
...and 8 more sections

Key Result

Lemma 1

Using (Bgamma), consider $B_{\gamma}(SSE)$ as a function of $SSE$ for fixed $n, k_0$, $SSE_0$ and $|\gamma|$. If $SSE_1\le SSE_2$ then $B_{\gamma}(SSE_1)\ge B_{\gamma}(SSE_2)$.

Figures (2)

Figure 1: For $k=49$, Half-$p$ (black); Jeffreys (red), Uniform (Green), Half-$k$ (blue); CGM (Cyan); Hierarchical (magenta); Beta(1,2) (gray); Harmonic (yellow).
Figure 2: For the obesity dataset, $k=47$, the log of ratio of the posterior probability of the most probable model of each dimension to the null model for different priors. Half-$p$ (black); Jeffreys (red), Uniform (Green), Half-$k$ (blue); CGM (Cyan); Hierarchical (magenta); Beta(1,2) (gray); Harmonic (yellow).

Theorems & Definitions (2)

Lemma 1
proof

Objective Model Prior Probabilities in Variable Selection

Abstract

Objective Model Prior Probabilities in Variable Selection

Authors

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (2)