Table of Contents
Fetching ...

Generalized vec trick for fast learning of pairwise kernel models

Markus Viljanen, Antti Airola, Tapio Pahikkala

TL;DR

This work presents a comprehensive review of pairwise kernels, that have been proposed for incorporating prior knowledge about the relationship between the objects, and shows how the reviewed kernels can be expressed as sums of Kronecker products, allowing the use of generalized vec trick for speeding up their computation.

Abstract

Pairwise learning corresponds to the supervised learning setting where the goal is to make predictions for pairs of objects. Prominent applications include predicting drug-target or protein-protein interactions, or customer-product preferences. In this work, we present a comprehensive review of pairwise kernels, that have been proposed for incorporating prior knowledge about the relationship between the objects. Specifically, we consider the standard, symmetric and anti-symmetric Kronecker product kernels, metric-learning, Cartesian, ranking, as well as linear, polynomial and Gaussian kernels. Recently, a O(nm + nq) time generalized vec trick algorithm, where n, m, and q denote the number of pairs, drugs and targets, was introduced for training kernel methods with the Kronecker product kernel. This was a significant improvement over previous O(n^2) training methods, since in most real-world applications m,q << n. In this work we show how all the reviewed kernels can be expressed as sums of Kronecker products, allowing the use of generalized vec trick for speeding up their computation. In the experiments, we demonstrate how the introduced approach allows scaling pairwise kernels to much larger data sets than previously feasible, and provide an extensive comparison of the kernels on a number of biological interaction prediction tasks.

Generalized vec trick for fast learning of pairwise kernel models

TL;DR

This work presents a comprehensive review of pairwise kernels, that have been proposed for incorporating prior knowledge about the relationship between the objects, and shows how the reviewed kernels can be expressed as sums of Kronecker products, allowing the use of generalized vec trick for speeding up their computation.

Abstract

Pairwise learning corresponds to the supervised learning setting where the goal is to make predictions for pairs of objects. Prominent applications include predicting drug-target or protein-protein interactions, or customer-product preferences. In this work, we present a comprehensive review of pairwise kernels, that have been proposed for incorporating prior knowledge about the relationship between the objects. Specifically, we consider the standard, symmetric and anti-symmetric Kronecker product kernels, metric-learning, Cartesian, ranking, as well as linear, polynomial and Gaussian kernels. Recently, a O(nm + nq) time generalized vec trick algorithm, where n, m, and q denote the number of pairs, drugs and targets, was introduced for training kernel methods with the Kronecker product kernel. This was a significant improvement over previous O(n^2) training methods, since in most real-world applications m,q << n. In this work we show how all the reviewed kernels can be expressed as sums of Kronecker products, allowing the use of generalized vec trick for speeding up their computation. In the experiments, we demonstrate how the introduced approach allows scaling pairwise kernels to much larger data sets than previously feasible, and provide an extensive comparison of the kernels on a number of biological interaction prediction tasks.

Paper Structure

This paper contains 13 sections, 1 theorem, 14 equations, 2 figures, 4 tables.

Key Result

theorem 1

Let Then, the operation can be carried out in $O(\textnormal{min}(\overline{q}n+m\overline{n},\overline{m}n+q\overline{n}))$ time using a sparse Kronecker product multiplication algorithm known as the generalized vec-trick (GVT).

Figures (2)

  • Figure 1: Illustration of pairwise data. The 'chessboard' is a XOR-function of drug and target parities, whereas 'tablecloth' is a SUM-function of the parities.
  • Figure 2: Illustration of a pairwise data set with (drug,target)-pairs and sparse labels. Different types of test sets corresponding to different settings are illustrated with different colors.

Theorems & Definitions (2)

  • theorem 1: Airola2017gvt
  • definition thmcounterdefinition: Commutation and unification operators