An Information Geometric Approach to Fairness With Equalized Odds Constraint
Amirreza Zamani, Ayfer Özgür, Mikael Skoglund
TL;DR
The paper tackles fair representation learning under equalized odds when the sensitive attribute S and task T are not directly observed. It introduces a local χ^2 constraint on P_{S|Y} and uses information-geometric, second-order approximations to replace mutual information terms, reducing the design to a quadratic program with a compression constraint. By analyzing the resulting matrices W^{T;Y} and W^{X;Y}, it derives singular-value–based lower bounds that yield a simple, low-complexity design (P2) and demonstrates near-optimality in certain regimes via a numerical example. The work offers a practical framework for balancing utility, fairness, and compression in privacy-aware settings, with potential for generalization to other fairness criteria and risk models.
Abstract
We study the statistical design of a fair mechanism that attains equalized odds, where an agent uses some useful data (database) $X$ to solve a task $T$. Since both $X$ and $T$ are correlated with some latent sensitive attribute $S$, the agent designs a representation $Y$ that satisfies an equalized odds, that is, such that $I(Y;S|T) =0$. In contrast to our previous work, we assume here that the agent has no direct access to $S$ and $T$; hence, the Markov chains $S - X - Y$ and $T - X - Y$ hold. Furthermore, we impose a geometric structure on the conditional distribution $P_{S|Y}$, allowing $Y$ and $S$ to have a small correlation, bounded by a threshold. When the threshold is small, concepts from information geometry allow us to approximate mutual information and reformulate the fair mechanism design problem as a quadratic program with closed-form solutions under certain constraints. For other cases, we derive simple, low-complexity lower bounds based on the maximum singular value and vector of a matrix. Finally, we compare our designs with the optimal solution in a numerical example.
