Mitigating Negative Flips via Margin Preserving Training
Simone Ricci, Niccolò Biondi, Federico Pernici, Alberto Del Bimbo
TL;DR
The paper addresses negative flips that occur when updating image classifiers as new classes are added, noting that margin shrinkage exacerbates this issue. It introduces Margin Preserving Training (MPT), which combines a margin-calibrated softmax loss with a positive logit bias for new classes and a double-source focal distillation from both the old model and a fully trained new model to balance learning across old and new classes. The approach achieves consistent reductions in Negative Flip Rate while maintaining or improving overall accuracy on CIFAR100 and ImageNet1K, with qualitative analyses confirming margin preservation. The method provides a practical upgrade mechanism that mitigates prediction churn without resorting to expensive ensembles, albeit with some computational overhead and hyperparameter tuning.
Abstract
Minimizing inconsistencies across successive versions of an AI system is as crucial as reducing the overall error. In image classification, such inconsistencies manifest as negative flips, where an updated model misclassifies test samples that were previously classified correctly. This issue becomes increasingly pronounced as the number of training classes grows over time, since adding new categories reduces the margin of each class and may introduce conflicting patterns that undermine their learning process, thereby degrading performance on the original subset. To mitigate negative flips, we propose a novel approach that preserves the margins of the original model while learning an improved one. Our method encourages a larger relative margin between the previously learned and newly introduced classes by introducing an explicit margin-calibration term on the logits. However, overly constraining the logit margin for the new classes can significantly degrade their accuracy compared to a new independently trained model. To address this, we integrate a double-source focal distillation loss with the previous model and a new independently trained model, learning an appropriate decision margin from both old and new data, even under a logit margin calibration. Extensive experiments on image classification benchmarks demonstrate that our approach consistently reduces the negative flip rate with high overall accuracy.
