Exploring Weight Balancing on Long-Tailed Recognition Problem
Naoya Hasegawa, Issei Sato
TL;DR
This work investigates weight balancing (WB) for long-tailed recognition, linking its effectiveness to neural collapse and the cone effect. It decomposes WB into five components—WD, MaxNorm, CE, CB, and two-stage learning—and shows that stage-1 WD+CE raise Fisher's discriminant ratio and suppress inter-class cosine similarities, while stage-2 WD+CB induces implicit logit adjustment by reallocating classifier weight norms toward tail classes. The authors prove that, under neural-collapse-like conditions, WB can be simplified to a one-stage approach using WD, feature regularization, and an ETF classifier with multiplicative LA, yielding comparable or superior performance with reduced training complexity. The findings provide a principled guideline for designing LTR training and demonstrate practical simplifications that improve accuracy and efficiency across multiple datasets and model families.
Abstract
Recognition problems in long-tailed data, in which the sample size per class is heavily skewed, have gained importance because the distribution of the sample size per class in a dataset is generally exponential unless the sample size is intentionally adjusted. Various methods have been devised to address these problems.Recently, weight balancing, which combines well-known classical regularization techniques with two-stage training, has been proposed. Despite its simplicity, it is known for its high performance compared with existing methods devised in various ways. However, there is a lack of understanding as to why this method is effective for long-tailed data. In this study, we analyze weight balancing by focusing on neural collapse and the cone effect at each training stage and found that it can be decomposed into an increase in Fisher's discriminant ratio of the feature extractor caused by weight decay and cross entropy loss and implicit logit adjustment caused by weight decay and class-balanced loss. Our analysis enables the training method to be further simplified by reducing the number of training stages to one while increasing accuracy. Code is available at https://github.com/HN410/Exploring-Weight-Balancing-on-Long-Tailed-Recognition-Problem.
