Table of Contents
Fetching ...

Learning Is a Kan Extension

Matthew Pugh, Jo Grundy, Corina Cirstea, Nick Harris

TL;DR

This work formalises error minimisation within a category-theoretic framework by proving that all such problems can be represented as a left Kan extension, specifically $Lan_iota d$, which acts as a global minimiser. It introduces $S$ flavoured error as a lax 2-functor $Err:D\to S$ and shows that when $Inf:M\to D$ has a left adjoint $Alg:D\to M$, then $Alg(d)$ is a global minimiser for each $d$, while left Kan extensions provide a universal minimising construction. The paper also presents a universal representation: any set-theoretic error minimisation problem can be recast as a 2-category extension problem whose left Kan extension yields the global minimisers. Overall, the framework offers an algebraic lens for interpreting and guiding optimization in data transformations, with inference treated as a pseudo-inverse and Kan extensions serving as a unifying tool for existence and construction of optimal solutions.

Abstract

Previous work has demonstrated that efficient algorithms exist for computing Kan extensions and that some Kan extensions have interesting similarities to various machine learning algorithms. This paper closes the gap by proving that all error minimisation algorithms may be presented as a Kan extension. This result provides a foundation for future work to investigate the optimisation of machine learning algorithms through their presentation as Kan extensions. A corollary of this representation of error-minimising algorithms is a presentation of error from the perspective of lossy and lossless transformations of data.

Learning Is a Kan Extension

TL;DR

This work formalises error minimisation within a category-theoretic framework by proving that all such problems can be represented as a left Kan extension, specifically , which acts as a global minimiser. It introduces flavoured error as a lax 2-functor and shows that when has a left adjoint , then is a global minimiser for each , while left Kan extensions provide a universal minimising construction. The paper also presents a universal representation: any set-theoretic error minimisation problem can be recast as a 2-category extension problem whose left Kan extension yields the global minimisers. Overall, the framework offers an algebraic lens for interpreting and guiding optimization in data transformations, with inference treated as a pseudo-inverse and Kan extensions serving as a unifying tool for existence and construction of optimal solutions.

Abstract

Previous work has demonstrated that efficient algorithms exist for computing Kan extensions and that some Kan extensions have interesting similarities to various machine learning algorithms. This paper closes the gap by proving that all error minimisation algorithms may be presented as a Kan extension. This result provides a foundation for future work to investigate the optimisation of machine learning algorithms through their presentation as Kan extensions. A corollary of this representation of error-minimising algorithms is a presentation of error from the perspective of lossy and lossless transformations of data.

Paper Structure

This paper contains 7 sections, 8 theorems, 29 equations.

Key Result

Proposition 2.13

Given a monoidal preorder $S$ and a lax functor $F : P \rightarrow S$ then for composable morphisms $f$ and $g$ in $P$.

Theorems & Definitions (34)

  • Definition 2.1: Category
  • Definition 2.2: Functor
  • Definition 2.3: Natural Transform
  • Definition 2.4: 2-category
  • Definition 2.5: Adjoint Functors (triangle)
  • Definition 2.6: Left Kan Extension (local)
  • Definition 2.7: Monoid
  • Definition 2.8: Preorder
  • Remark 2.9
  • Definition 2.10: Bottom Element
  • ...and 24 more