Designing a Linearized Potential Function in Neural Network Optimization Using Csiszár Type of Tsallis Entropy

Keito Akiyama

Designing a Linearized Potential Function in Neural Network Optimization Using Csiszár Type of Tsallis Entropy

Keito Akiyama

TL;DR

This paper establishes a framework that utilizes a linearized potential function via Csisz\'{a}r type of Tsallis entropy, which is one of the generalized entropies, and shows that this framework enable us to derive an exponential convergence result.

Abstract

In recent years, learning for neural networks can be viewed as optimization in the space of probability measures. To obtain the exponential convergence to the optimizer, the regularizing term based on Shannon entropy plays an important role. Even though an entropy function heavily affects convergence results, there is almost no result on its generalization, because of the following two technical difficulties: one is the lack of sufficient condition for generalized logarithmic Sobolev inequality, and the other is the distributional dependence of the potential function within the gradient flow equation. In this paper, we establish a framework that utilizes a linearized potential function via Csiszár type of Tsallis entropy, which is one of the generalized entropies. We also show that our new framework enable us to derive an exponential convergence result.

Designing a Linearized Potential Function in Neural Network Optimization Using Csiszár Type of Tsallis Entropy

TL;DR

Abstract

Designing a Linearized Potential Function in Neural Network Optimization Using Csiszár Type of Tsallis Entropy

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (14)