Table of Contents
Fetching ...

Notes on the Mathematical Structure of GPT LLM Architectures

Spencer Becker-Kahn

Abstract

An exposition of the mathematics underpinning the neural network architecture of a GPT-3-style LLM.

Notes on the Mathematical Structure of GPT LLM Architectures

Abstract

An exposition of the mathematics underpinning the neural network architecture of a GPT-3-style LLM.

Paper Structure

This paper contains 16 sections, 43 equations.

Theorems & Definitions (6)

  • Remark 1.1
  • Remark 1.2
  • Remark 2.1
  • Remark 2.2
  • Remark 3.1
  • Remark 4.1