Deep Distributional Learning with Non-crossing Quantile Network
Guohao Shen, Runpeng Dai, Guojun Wu, Shikai Luo, Chengchun Shi, Hongtu Zhu
TL;DR
This work introduces a deep distributional learning framework built on non-crossing quantile networks (NQ-Net), enforcing monotonicity of conditional quantiles via a Mean-Gaps architecture with a strictly positive activation on gaps. It establishes non-asymptotic, minimax-optimal convergence guarantees, including robustness to misspecification, Hölder-smooth targets, and a manifold-based reduction of dimensionality, along with distributional RL guarantees under heavy-tailed rewards and dependent data. The theory is complemented by extensive numerical studies showing strong performance relative to existing non-crossing quantile methods and a distributional RL evaluation on Atari games. The results demonstrate that NQ-Net achieves accurate, non-crossing quantile estimates and scalable distributional learning, with practical impact for causal inference, policy evaluation, and robust RL in complex environments.
Abstract
In this paper, we introduce a non-crossing quantile (NQ) network for conditional distribution learning. By leveraging non-negative activation functions, the NQ network ensures that the learned distributions remain monotonic, effectively addressing the issue of quantile crossing. Furthermore, the NQ network-based deep distributional learning framework is highly adaptable, applicable to a wide range of applications, from classical non-parametric quantile regression to more advanced tasks such as causal effect estimation and distributional reinforcement learning (RL). We also develop a comprehensive theoretical foundation for the deep NQ estimator and its application to distributional RL, providing an in-depth analysis that demonstrates its effectiveness across these domains. Our experimental results further highlight the robustness and versatility of the NQ network.
