Density Ratio-based Proxy Causal Learning Without Density Ratios
Bariscan Bozkurt, Ben Deaner, Dimitri Meunier, Liyuan Xu, Arthur Gretton
TL;DR
This work introduces a density-ratio-free proxy causal learning method that leverages a treatment bridge function within an RKHS to identify and estimate dose-response and conditional dose-response curves under hidden confounding. By formulating the problem with kernel mean embeddings and a three-stage regression (including a third-stage regression for ATT), the authors derive closed-form estimators for $f_{ATE}$ and $f_{ATT}$ and prove non-asymptotic consistency under standard RKHS and completeness assumptions. The approach avoids explicit density ratio estimation, enabling effective handling of continuous and high-dimensional treatments, with strong theoretical guarantees and extensive empirical validation on synthetic and real data. Overall, the paper advances PCL by providing a practical, scalable, and theoretically sound density-ratio-free framework for causal effect estimation in the presence of unobserved confounding.
Abstract
We address the setting of Proxy Causal Learning (PCL), which has the goal of estimating causal effects from observed data in the presence of hidden confounding. Proxy methods accomplish this task using two proxy variables related to the latent confounder: a treatment proxy (related to the treatment) and an outcome proxy (related to the outcome). Two approaches have been proposed to perform causal effect estimation given proxy variables; however only one of these has found mainstream acceptance, since the other was understood to require density ratio estimation - a challenging task in high dimensions. In the present work, we propose a practical and effective implementation of the second approach, which bypasses explicit density ratio estimation and is suitable for continuous and high-dimensional treatments. We employ kernel ridge regression to derive estimators, resulting in simple closed-form solutions for dose-response and conditional dose-response curves, along with consistency guarantees. Our methods empirically demonstrate superior or comparable performance to existing frameworks on synthetic and real-world datasets.
