Omitted Labels Induce Nontransitive Paradoxes in Causality
Bijan Mazaheri, Siddharth Jain, Matthew Cook, Jehoshua Bruck
TL;DR
This work analyzes how omitting label contexts in data can produce irrecoverable causal effects and lead to nontransitive, paradoxical conclusions. By connecting omitted-label causality to the Condorcet paradox, it formalizes a correspondence between networks of contexts and aggregations of rankings (AR) and soft rankings (ASR) within the linear ordering polytope. The core result shows ARs and ASRs occupy the same convex edge-weight space, enabling a unified view of consistency across partial studies and decision fusion across contexts. The framework offers bounds and diagnostics for inconsistencies when fusing conclusions from different label contexts, with implications for rare-disease studies and model-based decision fusion.
Abstract
We explore "omitted label contexts," in which training data is limited to a subset of the possible labels. This setting is standard among specialized human experts or specific, focused studies. By studying Simpson's paradox, we observe that ``correct'' adjustments sometimes require non-exchangeable treatment and control groups. A generalization of Simpson's paradox leads us to study networks of conclusions drawn from different contexts, within which a paradox of nontransitivity arises. We prove that the space of possible nontransitive structures in these networks exactly corresponds to structures that form from aggregating ranked-choice votes.
