The Effects of Mixed Sample Data Augmentation are Class Dependent
Haeil Lee, Hansang Lee, Junmo Kim
TL;DR
This work shows that Mixed Sample Data Augmentation (MSDA) methods like Mixup, CutMix, and PuzzleMix induce class-dependent effects, improving some classes while degrading others despite overall gains. The authors define class-level metrics, including $R(m)$, $ΔR_{MSDA}(m)$, $N_{DC}$, and $\overline{ΔR_{DC}}$, to quantify degradation and then introduce DropMix, a simple strategy that randomly excludes MSDA samples in a controlled way to blend MSDA with non-MSDA data during training. Across CIFAR-100 and ImageNet, DropMix reduces the number of degraded classes and mitigates average recall degradation, while often improving overall accuracy; results are demonstrated for multiple MSDA methods and network architectures. The work highlights the non-uniform impact of data augmentation on class performance and offers a practical, low-cost mitigation with potential implications for fairness and reliability in AI systems; it also opens avenues for deeper analysis of when and why MSDA harms certain classes and how to tailor augmentation to minimize bias.
Abstract
Mixed Sample Data Augmentation (MSDA) techniques, such as Mixup, CutMix, and PuzzleMix, have been widely acknowledged for enhancing performance in a variety of tasks. A previous study reported the class dependency of traditional data augmentation (DA), where certain classes benefit disproportionately compared to others. This paper reveals a class dependent effect of MSDA, where some classes experience improved performance while others experience degraded performance. This research addresses the issue of class dependency in MSDA and proposes an algorithm to mitigate it. The approach involves training on a mixture of MSDA and non-MSDA data, which not only mitigates the negative impact on the affected classes, but also improves overall accuracy. Furthermore, we provide in-depth analysis and discussion of why MSDA introduced class dependencies and which classes are most likely to have them.
