Why High-rank Neural Networks Generalize?: An Algebraic Framework with RKHSs

Yuka Hashimoto; Sho Sonoda; Isao Ishikawa; Masahiro Ikeda

Why High-rank Neural Networks Generalize?: An Algebraic Framework with RKHSs

Yuka Hashimoto, Sho Sonoda, Isao Ishikawa, Masahiro Ikeda

TL;DR

This work derives a new Rademacher complexity bound for deep neural networks using Koopman operators, group representations, and reproducing kernel Hilbert spaces (RKHSs) to derive a bound for a wider range of realistic models.

Abstract

We derive a new Rademacher complexity bound for deep neural networks using Koopman operators, group representations, and reproducing kernel Hilbert spaces (RKHSs). The proposed bound describes why the models with high-rank weight matrices generalize well. Although there are existing bounds that attempt to describe this phenomenon, these existing bounds can be applied to limited types of models. We introduce an algebraic representation of neural networks and a kernel function to construct an RKHS to derive a bound for a wider range of realistic models. This work paves the way for the Koopman-based theory for Rademacher complexity bounds to be valid for more practical situations.

Why High-rank Neural Networks Generalize?: An Algebraic Framework with RKHSs

TL;DR

Abstract

Why High-rank Neural Networks Generalize?: An Algebraic Framework with RKHSs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (27)