Table of Contents
Fetching ...

Sparse Hybrid Linear-Morphological Networks

Konstantinos Fotopoulos, Christos Garoufis, Petros Maragos

TL;DR

The paper tackles neural network efficiency by leveraging pruning-friendly morphological layers within a hybrid linear-morphological architecture. It replaces activations with sparse morphological layers inserted between linear blocks, and evaluates several heads including ReLU, Maxout, dense morphologies, and a sparsely initialized variant. The results show competitive accuracy across MTAT and CIFAR-10, with the sparse morphological design yielding the best pruning performance and faster early convergence, while inducing sparsity in the linear layers themselves. This approach offers practical gains in model compression and deployment efficiency, particularly for edge systems.

Abstract

We investigate hybrid linear-morphological networks. Recent studies highlight the inherent affinity of morphological layers to pruning, but also their difficulty in training. We propose a hybrid network structure, wherein morphological layers are inserted between the linear layers of the network, in place of activation functions. We experiment with the following morphological layers: 1) maxout pooling layers (as a special case of a morphological layer), 2) fully connected dense morphological layers, and 3) a novel, sparsely initialized variant of (2). We conduct experiments on the Magna-Tag-A-Tune (music auto-tagging) and CIFAR-10 (image classification) datasets, replacing the linear classification heads of state-of-the-art convolutional network architectures with our proposed network structure for the various morphological layers. We demonstrate that these networks induce sparsity to their linear layers, making them more prunable under L1 unstructured pruning. We also show that on MTAT our proposed sparsely initialized layer achieves slightly better performance than ReLU, maxout, and densely initialized max-plus layers, and exhibits faster initial convergence.

Sparse Hybrid Linear-Morphological Networks

TL;DR

The paper tackles neural network efficiency by leveraging pruning-friendly morphological layers within a hybrid linear-morphological architecture. It replaces activations with sparse morphological layers inserted between linear blocks, and evaluates several heads including ReLU, Maxout, dense morphologies, and a sparsely initialized variant. The results show competitive accuracy across MTAT and CIFAR-10, with the sparse morphological design yielding the best pruning performance and faster early convergence, while inducing sparsity in the linear layers themselves. This approach offers practical gains in model compression and deployment efficiency, particularly for edge systems.

Abstract

We investigate hybrid linear-morphological networks. Recent studies highlight the inherent affinity of morphological layers to pruning, but also their difficulty in training. We propose a hybrid network structure, wherein morphological layers are inserted between the linear layers of the network, in place of activation functions. We experiment with the following morphological layers: 1) maxout pooling layers (as a special case of a morphological layer), 2) fully connected dense morphological layers, and 3) a novel, sparsely initialized variant of (2). We conduct experiments on the Magna-Tag-A-Tune (music auto-tagging) and CIFAR-10 (image classification) datasets, replacing the linear classification heads of state-of-the-art convolutional network architectures with our proposed network structure for the various morphological layers. We demonstrate that these networks induce sparsity to their linear layers, making them more prunable under L1 unstructured pruning. We also show that on MTAT our proposed sparsely initialized layer achieves slightly better performance than ReLU, maxout, and densely initialized max-plus layers, and exhibits faster initial convergence.

Paper Structure

This paper contains 5 sections, 17 equations, 2 figures, 3 tables.

Figures (2)

  • Figure 1: Proposed network structure, prepended by a CNN backbone, for the case of MTAT and CIFAR; ReLU activations have been replaced by a sparse morphological layer. Each MP has on average 2 input weights (red)
  • Figure 2: Error plot of validation ROC-AUC and PR-AUC scores of different methods on MTAT for the first 25 epochs of training; 0.9 confidence interval, x-axis is in log-scale.