2-Cats: 2D Copula Approximating Transforms
Flavio Figueiredo, José Geraldo Fernandes, Jackson Silva, Renato M. Assunção
TL;DR
The paper tackles learning flexible 2D copulas without relying on fixed parametric families by introducing 2-Cats, a neural network that constructs a Copula transform H_{}{{\theta}}(u,v) = G(z(t_v(u)), z(t_u(v))) using monotone, integrally-constructed transforms to satisfy Copula properties. It enforces Copula desiderata via a Lagrangian framework to approximate boundary conditions (H_{}(u,1) \approx u and H_{}(1,v) \approx v) and augments training with Sobolev losses that align not only the Copula output but also its first and second derivatives, enabling derivative-aware sampling and Vine-like constructions. The approach is validated on synthetic and real datasets, showing competitive or superior negative log-likelihoods compared to baselines, and ablation studies highlight the value of each loss component and the Lagrangian term. The work advances Copula modeling by combining flexible neural transforms with principled constraint enforcement and derivative-informed training, with potential extensions to higher dimensions via Pair Copula Constructions. Overall, 2-Cats demonstrates strong practical impact for modeling multivariate dependencies without restrictive Copula families.
Abstract
Copulas are powerful statistical tools for capturing dependencies across data dimensions. Applying Copulas involves estimating independent marginals, a straightforward task, followed by the much more challenging task of determining a single copulating function, $C$, that links these marginals. For bivariate data, a copula takes the form of a two-increasing function $C: (u,v)\in \mathbb{I}^2 \rightarrow \mathbb{I}$, where $\mathbb{I} = [0, 1]$. This paper proposes 2-Cats, a Neural Network (NN) model that learns two-dimensional Copulas without relying on specific Copula families (e.g., Archimedean). Furthermore, via both theoretical properties of the model and a Lagrangian training approach, we show that 2-Cats meets the desiderata of Copula properties. Moreover, inspired by the literature on Physics-Informed Neural Networks and Sobolev Training, we further extend our training strategy to learn not only the output of a Copula but also its derivatives. Our proposed method exhibits superior performance compared to the state-of-the-art across various datasets while respecting (provably for most and approximately for a single other) properties of C.
