Table of Contents
Fetching ...

Toward Understanding Convolutional Neural Networks from Volterra Convolution Perspective

Tenghui Li, Guoxu Zhou, Yuning Qiu, Qibin Zhao

TL;DR

The paper tackles the challenge of interpreting complex CNN architectures by proposing a unified Volterra convolution framework, where networks are approximated by a finite-term Volterra series $\sum_{n=0}^{N} H_n * x^n$. It demonstrates that most CNNs can be represented or closely approximated within this formalism, enabling analysis via a smaller set of proxy kernels rather than the full, intricate architectures. It introduces two practical kernel-inference approaches—direct calculation for white-box networks and a hacking network for black-box scenarios—and validates them on simple two- and three-layer structures, showing close matches between true and proxy kernels. The work further explores how these proxies can be used for perturbation analysis and potential adversarial applications, including a MNIST demonstration, and suggests future directions for higher-order kernels and dynamic inputs with open-source code release.

Abstract

We make an attempt to understanding convolutional neural network by exploring the relationship between (deep) convolutional neural networks and Volterra convolutions. We propose a novel approach to explain and study the overall characteristics of neural networks without being disturbed by the horribly complex architectures. Specifically, we attempt to convert the basic structures of a convolutional neural network (CNN) and their combinations to the form of Volterra convolutions. The results show that most of convolutional neural networks can be approximated in the form of Volterra convolution, where the approximated proxy kernels preserve the characteristics of the original network. Analyzing these proxy kernels may give valuable insight about the original network. Base on this setup, we presented methods to approximating the order-zero and order-one proxy kernels, and verified the correctness and effectiveness of our results.

Toward Understanding Convolutional Neural Networks from Volterra Convolution Perspective

TL;DR

The paper tackles the challenge of interpreting complex CNN architectures by proposing a unified Volterra convolution framework, where networks are approximated by a finite-term Volterra series . It demonstrates that most CNNs can be represented or closely approximated within this formalism, enabling analysis via a smaller set of proxy kernels rather than the full, intricate architectures. It introduces two practical kernel-inference approaches—direct calculation for white-box networks and a hacking network for black-box scenarios—and validates them on simple two- and three-layer structures, showing close matches between true and proxy kernels. The work further explores how these proxies can be used for perturbation analysis and potential adversarial applications, including a MNIST demonstration, and suggests future directions for higher-order kernels and dynamic inputs with open-source code release.

Abstract

We make an attempt to understanding convolutional neural network by exploring the relationship between (deep) convolutional neural networks and Volterra convolutions. We propose a novel approach to explain and study the overall characteristics of neural networks without being disturbed by the horribly complex architectures. Specifically, we attempt to convert the basic structures of a convolutional neural network (CNN) and their combinations to the form of Volterra convolutions. The results show that most of convolutional neural networks can be approximated in the form of Volterra convolution, where the approximated proxy kernels preserve the characteristics of the original network. Analyzing these proxy kernels may give valuable insight about the original network. Base on this setup, we presented methods to approximating the order-zero and order-one proxy kernels, and verified the correctness and effectiveness of our results.

Paper Structure

This paper contains 34 sections, 19 theorems, 149 equations, 20 figures, 1 table.

Key Result

Proposition 10

The flatten-operator is homomorphic

Figures (20)

  • Figure 1: Differences between addition type and subtraction type.
  • Figure 2: A brief preview of the $\text{diag}(\cdot)$ operator.
  • Figure 3: Examples of outer Convolution.
  • Figure 4: Flattening a two-dimensional signal to a one-dimensional signal.
  • Figure 5: The number of terms that added to proxy kernels. The horizontal axis is the order and the vertical axis is the number of terms.
  • ...and 15 more figures

Theorems & Definitions (31)

  • Remark 1
  • Definition 2
  • Definition 3
  • Definition 4: Continuous outer convolution of kernels
  • Remark 5
  • Remark 6
  • Remark 7
  • Remark 8
  • Definition 9: Flatten-operator
  • Proposition 10
  • ...and 21 more