Domain Generalisation via Imprecise Learning
Anurag Singh, Siu Lun Chau, Shahine Bouabid, Krikamol Muandet
TL;DR
This work tackles the challenge of out-of-distribution generalisation under deployment uncertainty by introducing Imprecise Domain Generalisation (IDG), which lets a learner optimize against a continuum of generalisation strategies without committing to a single notion during training. The framework uses an imprecise risk optimisation (IRO) approach and a deployment-time operator framework, enabling risk aversion to be tuned via CVaR level $\lambda\in[0,1]$ and aggregated through a credal set ${\mathbb K}(Z)$. Core theoretical contributions include the introduction of Continuous-Pareto (C-Pareto) optimality for augmented hypotheses ${\mathcal H}_{\Lambda}$, a gradient-based IRO algorithm with a high-probability excess risk bound of order $O(n^{-1/2}+m^{-1/2})$, and results showing that scalarising across $\Lambda$ preserves Bayes-optimality on the support. Empirically, IDG demonstrates robust generalisation across synthetic domains, CMNIST with domain-specific risk, and real-world Bike Rentals, achieving low maximal regret and competitive performance across operator preferences. Overall, the approach decouples learning from deployment-time generalisation choice, offering a principled way to handle generalisation uncertainty in practical OOD settings.
Abstract
Out-of-distribution (OOD) generalisation is challenging because it involves not only learning from empirical data, but also deciding among various notions of generalisation, e.g., optimising the average-case risk, worst-case risk, or interpolations thereof. While this choice should in principle be made by the model operator like medical doctors, this information might not always be available at training time. The institutional separation between machine learners and model operators leads to arbitrary commitments to specific generalisation strategies by machine learners due to these deployment uncertainties. We introduce the Imprecise Domain Generalisation framework to mitigate this, featuring an imprecise risk optimisation that allows learners to stay imprecise by optimising against a continuous spectrum of generalisation strategies during training, and a model framework that allows operators to specify their generalisation preference at deployment. Supported by both theoretical and empirical evidence, our work showcases the benefits of integrating imprecision into domain generalisation.
