A Tutorial on Principal Component Analysis
Jonathon Shlens
TL;DR
PCA addresses how to reveal simple structure in high-dimensional data by seeking a linear change of basis that concentrates variance into a few directions. The paper presents both intuitive toy- examples and rigorous linear-algebra derivations, showing that PCA is tightly linked to eigenvector decompositions of the covariance matrix and to the singular value decomposition (SVD). It provides practical steps for centering data, computing PCA via eigen decomposition or SVD, and interpreting the resulting principal components and variances. It also discusses limitations, including nonlinearity and higher-order dependencies, and points to kernel PCA and ICA as extensions when linear, second-order assumptions are insufficient.
Abstract
Principal component analysis (PCA) is a mainstay of modern data analysis - a black box that is widely used but (sometimes) poorly understood. The goal of this paper is to dispel the magic behind this black box. This manuscript focuses on building a solid intuition for how and why principal component analysis works. This manuscript crystallizes this knowledge by deriving from simple intuitions, the mathematics behind PCA. This tutorial does not shy away from explaining the ideas informally, nor does it shy away from the mathematics. The hope is that by addressing both aspects, readers of all levels will be able to gain a better understanding of PCA as well as the when, the how and the why of applying this technique.
