t-SNE Exaggerates Clusters, Provably
Noah Bergam, Szymon Snoeck, Nakul Verma
TL;DR
This work proves that t-SNE visualizations can misrepresent input cluster structures and outlier extremity, showing that (i) similarly clustered outputs can come from arbitrarily unclustered inputs and (ii) tiny input perturbations can yield vastly different visualizations. It introduces impostor datasets that share t-SNE embeddings with the original data, exploiting additive and multiplicative invariances to distance, and proves a fundamental limit: stationary t-SNE outputs cannot depict extreme outliers beyond a small constant. The results are supported by formal theorems and empirical demonstrations on real and synthetic data (e.g., single-cell and BBC news embeddings), revealing substantial risks of false positives and misinterpretation. Overall, the paper establishes principled limits on what can be inferred from t-SNE plots and motivates the search for more reliable visualization guarantees and tools for exploratory data analysis.
Abstract
Central to the widespread use of t-distributed stochastic neighbor embedding (t-SNE) is the conviction that it produces visualizations whose structure roughly matches that of the input. To the contrary, we prove that (1) the strength of the input clustering, and (2) the extremity of outlier points, cannot be reliably inferred from the t-SNE output. We demonstrate the prevalence of these failure modes in practice as well.
