Table of Contents
Fetching ...

RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer

Jiawei Zhang

TL;DR

By incorporating data and structural interdependence functions, RPN 2 explicitly models data interdependence via new component functions in its architecture, enabling it to encompass a broader range of contemporary dominant backbone models within its canonical representation.

Abstract

This paper builds upon our previous work on the Reconciled Polynomial Network (RPN). The original RPN model was designed under the assumption of input data independence, presuming the independence among both individual instances within data batches and attributes in each data instance. However, this assumption often proves invalid for function learning tasks involving complex, interdependent data such as language, images, time series, and graphs. Ignoring such data interdependence may inevitably lead to significant performance degradation. To overcome these limitations, we introduce the new Reconciled Polynomial Network (version 2), namely RPN 2, in this paper. By incorporating data and structural interdependence functions, RPN 2 explicitly models data interdependence via new component functions in its architecture. This enhancement not only significantly improves RPN 2's learning performance but also substantially expands its unifying potential, enabling it to encompass a broader range of contemporary dominant backbone models within its canonical representation. These backbones include, but are not limited to, convolutional neural networks (CNNs), recurrent neural networks (RNNs), graph neural networks (GNNs), and Transformers. Our analysis reveals that the fundamental distinctions among these backbone models primarily stem from their diverse approaches to defining the interdependence functions. Furthermore, this unified representation opens up new opportunities for designing innovative architectures with the potential to surpass the performance of these dominant backbones.

RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer

TL;DR

By incorporating data and structural interdependence functions, RPN 2 explicitly models data interdependence via new component functions in its architecture, enabling it to encompass a broader range of contemporary dominant backbone models within its canonical representation.

Abstract

This paper builds upon our previous work on the Reconciled Polynomial Network (RPN). The original RPN model was designed under the assumption of input data independence, presuming the independence among both individual instances within data batches and attributes in each data instance. However, this assumption often proves invalid for function learning tasks involving complex, interdependent data such as language, images, time series, and graphs. Ignoring such data interdependence may inevitably lead to significant performance degradation. To overcome these limitations, we introduce the new Reconciled Polynomial Network (version 2), namely RPN 2, in this paper. By incorporating data and structural interdependence functions, RPN 2 explicitly models data interdependence via new component functions in its architecture. This enhancement not only significantly improves RPN 2's learning performance but also substantially expands its unifying potential, enabling it to encompass a broader range of contemporary dominant backbone models within its canonical representation. These backbones include, but are not limited to, convolutional neural networks (CNNs), recurrent neural networks (RNNs), graph neural networks (GNNs), and Transformers. Our analysis reveals that the fundamental distinctions among these backbone models primarily stem from their diverse approaches to defining the interdependence functions. Furthermore, this unified representation opens up new opportunities for designing innovative architectures with the potential to surpass the performance of these dominant backbones.

Paper Structure

This paper contains 129 sections, 192 equations, 37 figures, 6 tables.

Figures (37)

  • Figure 1: An illustration of data interdependence modeling in RPN 2. Plots (a)-(d) show some examples of interdependent data examples: (a) a colored image of a hummingbird, (b) a sentence and its parsing structure, (c) time-series price data of six stocks, and (d) the Myricetin molecular graph. These data instances in different modalities can all be fed as inputs to the RPN 2 model. Plots (e)-(g) provide the matrices representations of the input data in RPN 2: (e) input data batch, (f) calculated (instance) interdependence matrix, and (g) output data batch after transformation. Plots (h)-(j) indicate the learning space of RPN 2: (h) input data space, (i) interdependence space, and (j) data transformation space used for defining the interdependence function and data transformation function.
  • Figure 2: Interdependence in Images.
  • Figure 3: Interdependence in Language.
  • Figure 4: Interdependency in Time-Series.
  • Figure 5: Interdependence in Graph.
  • ...and 32 more figures

Theorems & Definitions (4)

  • Definition 1
  • Definition 2
  • Definition 3
  • Example 1