An Analytical Model for Overparameterized Learning Under Class Imbalance
Eliav Mor, Yair Carmon
TL;DR
A tight, closed form approximation is developed for the test error of several practical learning methods, including logit adjustment and class dependent temperature, in a high-dimensional Gaussian mixture model.
Abstract
We study class-imbalanced linear classification in a high-dimensional Gaussian mixture model. We develop a tight, closed form approximation for the test error of several practical learning methods, including logit adjustment and class dependent temperature. Our approximation allows us to analytically tune and compare these methods, highlighting how and when they overcome the pitfalls of standard cross-entropy minimization. We test our theoretical findings on simulated data and imbalanced CIFAR10, MNIST and FashionMNIST datasets.
