Hamiltonian Monte Carlo on ReLU Neural Networks is Inefficient

Vu C. Dinh; Lam Si Tung Ho; Cuong V. Nguyen

Hamiltonian Monte Carlo on ReLU Neural Networks is Inefficient

Vu C. Dinh, Lam Si Tung Ho, Cuong V. Nguyen

TL;DR

Due to the non-differentiability of activation functions in the ReLU family, leapfrog HMC for networks with these activation functions has a large local error rate, which leads to a higher rejection rate of the proposals, making the method inefficient.

Abstract

We analyze the error rates of the Hamiltonian Monte Carlo algorithm with leapfrog integrator for Bayesian neural network inference. We show that due to the non-differentiability of activation functions in the ReLU family, leapfrog HMC for networks with these activation functions has a large local error rate of $Ω(ε)$ rather than the classical error rate of $O(ε^3)$. This leads to a higher rejection rate of the proposals, making the method inefficient. We then verify our theoretical findings through empirical simulations as well as experiments on a real-world dataset that highlight the inefficiency of HMC inference on ReLU-based neural networks compared to analytical networks.

Hamiltonian Monte Carlo on ReLU Neural Networks is Inefficient

TL;DR

Abstract

Hamiltonian Monte Carlo on ReLU Neural Networks is Inefficient

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (5)