An inferential measure of dependence between two systems using Bayesian model comparison
Guillaume Marrelec, Alain Giron
TL;DR
The paper presents a principled Bayesian framework to quantify dependence between two systems by comparing a joint independence model $H_0$ with a dependent model $H_1$, and defining the dependence measure $B(X,Y|D) = P(H_1|D)$ (or a strictly increasing function of it). It shows how $B$ relates to classical dependence notions such as mutual information and the log-likelihood ratio, and derives asymptotic behavior under known distributions, unknown-parameter likelihoods, nested models, and copula-based dependence; it also analyzes misspecification effects. Through extensive simulations and a real-life EEG application, the authors demonstrate that the log-posterior odds $\mathfrak{B}_{\mathrm{logr}}$ increases with sample size and true dependence strength, decreases under independence, and behaves consistently with an inferential measure of dependence. The framework provides an interpretable, model-based, data-driven measure of dependence that complements existing metrics by quantifying the evidence for dependence given a specified dependence model, while highlighting the importance of model choice and priors. Overall, the work bridges Bayesian model comparison with information-theoretic dependence concepts and offers a versatile toolkit for assessing dependence in diverse data settings.
Abstract
We propose to quantify dependence between two systems $X$ and $Y$ in a dataset $D$ based on the Bayesian comparison of two models: one, $H_0$, of statistical independence and another one, $H_1$, of dependence. In this framework, dependence between $X$ and $Y$ in $D$, denoted $B(X,Y|D)$, is quantified as $P(H_1|D)$, the posterior probability for the model of dependence given $D$, or any strictly increasing function thereof. It is therefore a measure of the evidence for dependence between $X$ and $Y$ as modeled by $H_1$ and observed in $D$. We review several statistical models and reconsider standard results in the light of $B(X,Y|D)$ as a measure of dependence. Using simulations, we focus on two specific issues: the effect of noise and the behavior of $B(X,Y|D)$ when $H_1$ has a parameter coding for the intensity of dependence. We then derive some general properties of $B(X,Y|D)$, showing that it quantifies the information contained in $D$ in favor of $H_1$ versus $H_0$. While some of these properties are typical of what is expected from a valid measure of dependence, others are novel and naturally appear as desired features for specific measures of dependence, which we call inferential. We finally put these results in perspective; in particular, we discuss the consequences of using the Bayesian framework as well as the similarities and differences between $B(X,Y|D)$ and mutual information.
