Hybrid Models with Deep and Invertible Features
Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Dilan Gorur, Balaji Lakshminarayanan
TL;DR
This work introduces the DIGLM, a neural hybrid that couples a deep invertible transform with a generalized linear model to jointly model p(x) and p(y|x) in a single forward pass. By leveraging invertible flows (RNVP/Glow) as feature extractors and sharing parameters with the predictive GLM, the model provides exact densities, enabling out-of-distribution detection and semi-supervised learning. Empirical results on regression and classification tasks show competitive predictive performance and strong uncertainty estimates, with notable improvements in negative log-likelihood when leveraging the generative component. The approach also bridges to Gaussian processes through a kernel defined by the latent representations, offering a practical probabilistic deep learning framework for downstream tasks that require density-based reasoning.
Abstract
We propose a neural hybrid model consisting of a linear model defined on a set of features computed by a deep, invertible transformation (i.e. a normalizing flow). An attractive property of our model is that both p(features), the density of the features, and p(targets | features), the predictive distribution, can be computed exactly in a single feed-forward pass. We show that our hybrid model, despite the invertibility constraints, achieves similar accuracy to purely predictive models. Moreover the generative component remains a good model of the input features despite the hybrid optimization objective. This offers additional capabilities such as detection of out-of-distribution inputs and enabling semi-supervised learning. The availability of the exact joint density p(targets, features) also allows us to compute many quantities readily, making our hybrid model a useful building block for downstream applications of probabilistic deep learning.
