Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

Chinmay Rane; Kanishka Tyagi; Michael Manry

Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

Chinmay Rane, Kanishka Tyagi, Michael Manry

TL;DR

A novel Adaptive Activation algorithm is proposed, AdAct, exhibiting promising performance improvements in diverse CNN and multilayer perceptron configurations, thereby presenting compelling results to support its usage.

Abstract

Deep learning training training algorithms are a huge success in recent years in many fields including speech, text,image video etc. Deeper and deeper layers are proposed with huge success with resnet structures having around 152 layers. Shallow convolution neural networks(CNN's) are still an active research, where some phenomena are still unexplained. Activation functions used in the network are of utmost importance, as they provide non linearity to the networks. Relu's are the most commonly used activation function.We show a complex piece-wise linear(PWL) activation in the hidden layer. We show that these PWL activations work much better than relu activations in our networks for convolution neural networks and multilayer perceptrons. Result comparison in PyTorch for shallow and deep CNNs are given to further strengthen our case.

Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

TL;DR

Abstract

Paper Structure (22 sections, 36 equations, 16 figures, 5 tables, 1 algorithm)

This paper contains 22 sections, 36 equations, 16 figures, 5 tables, 1 algorithm.

Introduction
Prior Work
Structure and Notation
Scaled conjugate gradient algorithm
Levenberg-Marquardt algorithm
Basic MOLF
Proposed Work
Exsisting Pieciewise Linear Activations
Mathematical Background
Piecewise Linear Activations
Experimental Results
Computational Burden
Approximation Datasets Results
Classifier Datasets Results
Shallow CNN results
...and 7 more sections

Figures (16)

Figure 1: Shallow CNN with Linear softmax cross-entropy classifier
Figure 2: Fixed Piecewise Linear activations
Figure 3: 2 ReLU curves
Figure 4: 4 ReLU curves
Figure 5: Approximate sigmoid using 2 ReLU curves
...and 11 more figures

Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

TL;DR

Abstract

Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

Authors

TL;DR

Abstract

Table of Contents

Figures (16)