Table of Contents
Fetching ...

On Minimal Depth in Neural Networks

Juan L. Valerdi

TL;DR

This study investigates two topics on ReLU neural network expressivity and their connection with a conjecture related to the minimum depth required for representing any continuous piecewise linear (CPWL) function, as well as the exploration of polytope neural networks.

Abstract

A characterization of the representability of neural networks is relevant to comprehend their success in artificial intelligence. This study investigate two topics on ReLU neural network expressivity and their connection with a conjecture related to the minimum depth required for representing any continuous piecewise linear (CPWL) function. The topics are the minimal depth representation of the sum and max operations, as well as the exploration of polytope neural networks. For the sum operation, we establish a sufficient condition on the minimal depth of the operands to find the minimal depth of the operation. In contrast, regarding the max operation, a comprehensive set of examples is presented, demonstrating that no sufficient conditions, depending solely on the depth of the operands, would imply a minimal depth for the operation. The study also examine the minimal depth relationship between convex CPWL functions. On polytope neural networks, we investigate basic depth properties from Minkowski sums, convex hulls, number of vertices, faces, affine transformations, and indecomposable polytopes. More significant findings include depth characterization of polygons; identification of polytopes with an increasing number of vertices, exhibiting small depth and others with arbitrary large depth; and most notably, the minimal depth of simplices, which is strictly related to the minimal depth conjecture in ReLU networks.

On Minimal Depth in Neural Networks

TL;DR

This study investigates two topics on ReLU neural network expressivity and their connection with a conjecture related to the minimum depth required for representing any continuous piecewise linear (CPWL) function, as well as the exploration of polytope neural networks.

Abstract

A characterization of the representability of neural networks is relevant to comprehend their success in artificial intelligence. This study investigate two topics on ReLU neural network expressivity and their connection with a conjecture related to the minimum depth required for representing any continuous piecewise linear (CPWL) function. The topics are the minimal depth representation of the sum and max operations, as well as the exploration of polytope neural networks. For the sum operation, we establish a sufficient condition on the minimal depth of the operands to find the minimal depth of the operation. In contrast, regarding the max operation, a comprehensive set of examples is presented, demonstrating that no sufficient conditions, depending solely on the depth of the operands, would imply a minimal depth for the operation. The study also examine the minimal depth relationship between convex CPWL functions. On polytope neural networks, we investigate basic depth properties from Minkowski sums, convex hulls, number of vertices, faces, affine transformations, and indecomposable polytopes. More significant findings include depth characterization of polygons; identification of polytopes with an increasing number of vertices, exhibiting small depth and others with arbitrary large depth; and most notably, the minimal depth of simplices, which is strictly related to the minimal depth conjecture in ReLU networks.
Paper Structure (8 sections, 37 theorems, 35 equations)

This paper contains 8 sections, 37 theorems, 35 equations.

Key Result

Theorem 1

Any CPWL function can be expressed by a ReLU neural network with $\lceil \log_2(n + 1) \rceil$ hidden layers.

Theorems & Definitions (68)

  • Conjecture 1: Hertrich et al. hertrich2023towards
  • Theorem 1: Arora et al. arora2018understanding
  • Theorem 2: Wang and Sun wang2005generalization
  • Conjecture 2: Hertrich et al. hertrich2023towards
  • Theorem 3: Hertrich et al. hertrich2023towards
  • Conjecture 3: Hertrich et al. hertrich2023towards
  • Proposition 1
  • proof
  • Proposition 2
  • proof
  • ...and 58 more