Activation functions enabling the addition of neurons and layers without altering outcomes

Sergio López-Ureña

Activation functions enabling the addition of neurons and layers without altering outcomes

Sergio López-Ureña

TL;DR

This work tackles the problem of expanding neural networks without changing their outputs by introducing activation functions that are refinable and sum the identity, enabling two key operations: widening a layer via neuron subdivision and inserting a new layer between existing layers. The approach is grounded in subdivision theory, constructing activations from basic limit functions of convergent schemes, notably spline activations like $\sigma_{B^d}$ with refinability $A=d+1$, $ au=d/2$, and the identity-summing property on a suitable interval. It also extends to general subdivision-based activations, providing explicit frameworks and pseudocode (Appendix A) for practical implementation. The results offer a principled, parameter-efficient path to function-preserving architecture growth with potential benefits for multi-level training and structural learning, while outlining open questions about higher-order schemes and closed-form derivatives for backpropagation.

Abstract

In this work, we propose activation functions for neuronal networks that are refinable and sum the identity. This new class of activation functions allows the insertion of new layers between existing ones and/or the increase of neurons in a layer, both without altering the network outputs. Our approach is grounded in subdivision theory. The proposed activation functions are constructed from basic limit functions of convergent subdivision schemes. As a showcase of our results, we introduce a family of spline activation functions and provide comprehensive details for their practical implementation.

Activation functions enabling the addition of neurons and layers without altering outcomes

TL;DR

Abstract

Activation functions enabling the addition of neurons and layers without altering outcomes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (22)